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Haplotvoes and Polymorphism* Voiced to hum an ThiopnriBe S-Methyltransferase 
Deficiencies 

Description 

BACKGROUND OF THE INVENTION 
5 Field of the Invention 

The present invention is in the field of cancer and immunosuppressive therapeutics, diagnostics, 
and drug metabolism. In particular, the present invention relates to characterization of the genetic 
basis for thiopurine mefoyltnmsferase deficiency, A number of single nucleotide polymorphisms 
are, at least in part, responsible for severe hematopoietic toxicity in cancer, Crohn's disease, 
10 autoimmune diseases pike rheumatoid arftdti's or lupus eryihematodes), multiple sclerosis or' 
organ transplant recipient patients who are treated with standard dosages of 6~mercaptopurine, 6- 
thioguanine or azathioprine (fhiopurines in general or other drugs that are substrates of the TPMT 
enzyme), 

Related Art 

15 Thiopurine methyltransferase (TPMT, B.C. ZJ,1.67) is a cytoplasmic enzyme that preferentially 
catalyzes the S~methylation of aromatic and heterocyclic' sulfhydryl compounds, including the 
anticancer agents 6~mercaptopurine (6MP) and 6«thioguanine, and the immunosuppressant 
azafcioprine collectively termed as thiopurines, TPMT activity exhibits genetic polymorphism, 
with approximately 90 % of Caucasians and African-Americans having high TPMT activity, 10 % 

20 intermediate activity . (due to heterozygosity), and 03 % inheriting TPMT-deficiency as an 
autosomal recessive trait (Weinshilboum, R. M. and Sladek, S. L., Am, J, Hum Genet, 32:651- 
662 (1980); McLeod, H, L. et al„ Clin, Pharmacol Thar. 55:15-20 (1994)). TPMT activity can be 
measured in erythrocytes, as the level of TPMT activity in human liver, kidney, lymphocytes and 
leukemic lymphoblast correlates with that in erythrocytes (Van Loon, J. A. and Weinshilboum, TL 

25 ML, Biochem. Genet 20:637-658 (1982); .SaumlansM, C. L. 9 et at, Pharmacogenetics 2:148459 
(1992); McLeod, EL L> et aL, Blood 85:18974902(1995)). 

Mercaptopurine, thioguanine, and azathioprine'are prodrugs with no intrinsic activity, requiring 
intracellular 'conversion to thioguanine nucleotides (TGN), with subsequent incorporation into 
DNA, as one mechanism of their antiproliferative effect (Lermard, L„ Eur. J. Clin. Pharmacol 
30 43:329-339 (1992)). Alternatively, these drugs are metabolized to 6-me&yl-mercaptopurine 
(MeMP) or 6-methyl-thioguanine (MeTG) by TPMT or to 6-fhiouric acid (6TU) by xan&ine 
oxidase; MeMP, MeTG, and 6TU are inactive metabolites. Thus, metabolism of 6MP, 
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azaihioprine, or Moguanine by TTMT shunts drug away from the TGN activation pathway. • 
Oinical studies with 6MP and azathioprine have established an inverse correlation between 
• erythrocyte TPMT activity and erythrocyte TGN accumulation, indicating that patients who less 
efficiently methylate these thiopurjnes have more extensive conversion to thioguanine nucleotides 

5 (Lennard, L., et al., Lancet 336:225-229 (1990); Lennard, L. et al., Clin. Pharmacol. Ther. 46:149- 
154 (1989)). Moreover, patients with TPMT deficiency accumulate significantly higher 
erythrocyte TGN if treated with standard dosages of 6MP or azathioprine, leading to severe 
hematopoietic' toxicity, unless the tMopurine dosage is lowered substantially (e.g. 8-15 fold 
reduction) (Evans, W. E., et ah, J. Pediatr. 19:985-989 (1991); McLeod, H. L., et al., Lancet 

10 341:1151 (1993); Lennard, L., et al., Arch. Dis; Child. 69:577-579 (1993)) or me converting 
enzyme hypoxantmne-guanine phosphoribosyltransferase (HGPRT) has a functional defect due to 
SNPs in promoter, splice or coding regions thatreduce activity of HGPRT and by this reduces 
amounts of thiopurines in the body. The majority of such patients are identified only after 
experiencing severe toxicity, even though prospective measurement of erythrocyte TPMT activity 

15 has been advocated by some (Lennard, L. etaL, aim Pharmacol. Ther. 41:18-25 (1987)). 

Unfortunately, TPMT assays are not widely available and newly diagnosed patients with leukemia 
or organ transplant recipients are frequently given erythrocyte transfusions, precluding measure- 
ment of their constitutive TPMT activity before miopurine therapy is initiated. Alternatively, 
several mutant alleles responsible for TPMT deficiency have been described and the relationship 
20 between TPMT geno- and phenotype has been most clearly defined for the clinically relevant 
TPMT alleles *2, *3A and *3C (represented in this file by reference SNPs 44, 47, and 50 
respectively) in patien'te and healthy subjects (Evans et al. J. Clin. Oncol. 19 (2001), 2293-2301, 
Evans et al. U.S. Patent 5,856,095). Whereupon the heterocygote form of the SNPs correlate to a 
deficient TPMT activity (reduced activity) and the mutant form of the three SNPs correlate to a 
25 more deficient TPMT activity (very reduced or very low activity, sometimes absent activity). 
Although the several mutant alleles are known to be associated with intermediate or low activity, 
molecular diagnosis by genotyoping can predict the TPMT phenotype only to 85-95 % (McLeod, 
Leukemia 14 (2000), 567-572; Yates, Ann. Intern. Med. 126 (1997), 608-614). 

A further relationship between genotype and phenotype was found in differences in the variable 
30 number of tandem repeats (VNTR) within the 5' untranslated region of the TPMT gene (Alves, S. 
et al., Clinical Pharmacology and Therapeutics 70 (2001), 165-174. The VNTR is composed of 3 
repeat elements A, B, and C, differing in length of the unit core (17 or 18 bp) and in nucleotide 
sequence. Repeats A and B usually can be repeated in the VNTR 1-6 times, repeat C usually is 
present only ones in the VNTR Depending on the number of repeats the expression rate of the 
35 TPMT protein differs. There seems to be an inverse correlation between the sum of the number of 
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repeats and fee VNTR and the level of TPMT activity but this correlation, is not very strong and • 
not well studied. 

Thus, means and methods for diagnosing and treating diseases, drag responses and disorders based 
on. dysfunction or dysregulations of TPMT are not reliably available yet and lack fee needed 
5 sensitivity and specificity of a diagnostic test Thus, the technical problem underlying the present 
invention is to comply with the above-specified needs. 

Identification of the hare described single nucleotide polymorphisms at the TPMT locus together 
with' the here disclosed algorithm for combining the respective genotypes of several single 
nucleotide polymorphism in a patient to one distinct information about the TPMT phenotype 
10 would enable a treating physician to prospectively identify TPMT-deficient patients based on then- 
genotype, prior to treatment with potentially toxic 'dosages "of thiopurines like mercaptopurine, 
azathioprine or fcioguanine. 

SUMMARY OF THE INVENTION . 

The invention relates to the discovery of single nucleotide polymorphisms in the TPMT gene 
15 together with an algorithm that can predict TPMT enzyme deficiencies. The presence of these 
mutant alleles is directly correlated with potentially fatal hematopoietic toxicity when patients are 
treated with standard dosages of mercaptopurine, azathioprine, or fbioguanine. 

t. 

Based on the discovery- of these single nucleotide polymorphisms together with an algorithm, 
' methods have been developed for detecting these inactivating mutations in genomic DNA isolated 
20 from individual patients (subjects), to make a diagnosis of TPMT-deficiency, or to identify 
heterozygous individuals (i.e., people with one mutant gene and one normal gene), having reduced 
or total deficient TPMT activity. The present invention, therefore,. provides a diagnostic test to 
identify patients wife reduced TPMT activity based on their genotype. Such diagnostic test to 
determine TPMT genotype of patients is quite advantageous because measuring a patient's TPMT 
25 enzyme activity has many limitations. Based on this information, we identified here a set of single 
nucleotide polymorphisms feat are new in this combination. Together wife a newly developed 
algorithm these SNPs are able to predict TPMT activity. These tests involve PCR-based 
amplification of a region of fee TPMT gene where the single nucleotide polymorphisms of 
interests are found. Following' amplification, fee amplified fragment is assayed for fee presence or 
30 absence of the specific single nucleotide polymorphisms of interest Although much of these 
assays can be done "by hand", e.g. sequencing oligonucleotide PCR primers, using a feermocycler 
and protocol to assay for fee presence or absence of a single nucleotide polymorphism, automated 
procedures and kits are designed that contain all the reagents, primers, solutions, et cetera for the 
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genotyping test to facilitate the procedure for use in general clinical laboratories such as those 1 
found in a typical hospital, clinic or commercial reference labs* 

A preferred embodiment of the present invention relates to the presence of a highly homologues 
pseiidogene in the human genome. Whenever primers were designed to be allele-specific for the 

5 TPMT gene we compared both sequences (TPMT gene and pseudogene with bioinformatics 
programs like M£gAlign™ (from DNA Star or other programs) to identify sequences that are 
unique to the TPMT gene, These are for example the introns of the TPMT gene where allele 
specific primers for the TPMT gene can be located. For the few differences between the exons of 
the TPMT gene and the pseudogene primers are located in such a way that the 3 * part of the primer 

10 ends exactly on the TPMT gene where there is a difference between the two genes. 

In particular, the invention relates to isolated polynucleotide naoJecules comprising one or more 
mutant alleles of thiopurine S-meihyltransferase (TPMT) or a fragment thereof, which is at least 
ten consecutive bases long and contains one or more single nucleotide polymoiphisms. The single 
nucleotide polymorphisms are summarised in Table 1 , 

15 An aspect of the invention relates to polynucleotide molecules complementary to any one of the 
polynucleotide molecules described above, 

' A different aspect of the invention relates to a diagnostic assay for determining thiopurine S- 
methyl-transferase (TPMT) genotype of a person which comprises isolating nucleic acid from saJd J 
person, amplifying for a thiopurine S-methyltransferase (TPMT) PGR fragment from said nucleic 
20 t acid, which includes at least one preferably two or three and in an other aspect more than three of 
SNPs 141 of Table 1, thereby obtaining an amplified fragment. The size of the amplified fragment 
needs only be large enough so that it is detectable and useful for the genotyping methods described 
in this file, A preferred range of the amplified fragment size is from 14 nucleotides to several 
hundreds, more preferably from 75 to 400, and most preferably from 80 to 260, 

25 A further aspect of fee invention relates to an isolated polynucleotide molecule having one, two or 
more SNPs on one or more fragments. Moreover, the invention relates to an isolated poly- 
nucleotide molecule complementary to the polynucleotide molecules having a sequence of SNPs 1- 
41 of Table L 

An other preferred aspect of the invention relates to genotyping of the amplified fragments with 
30 methods described in this file but are not limited to these examples. 

An other preferred aspect of the invention is to sequence the VNTR region to identify the number 
■ of A, B, and C repeats that correlate to TPMT activity. • , 
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Yet another aspect of fee invention combines information about the TPMT genotype and the • 
HGPRT genotypes. As inactivating SNPs of the HGPRT gene will produce less, no or deficient 
HGPRT enzyme, there will be less toxic intermediates produced when a patient is under feiopurine 
therapy and the treating physician could adjust the dosage of iMopurines in a therapy scheme more 
5 precisely. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The autosomal recessive trait of thiopurine S-methyltransferase (TPMT) deficiency is associated 
with potentially fatal hematopoietic toxicity when patients are treated with standard dosages of 
mercaptopurine, azafeioprine or mioguanine (feiopurines in general or other drugs mat are 
10 substrates of the TPMT enzyme). A number of different single nucleotide polymorphisms in the 
TPMT gene are described herein mat we found to be associated to TPMT deficiencies either alobe 
or preferably in combinations (SNPs 1-41 of Table 1). 

Based on the sequence of the mutant alleles provided herein, PGR primers are constructed that are 
complementary to the region of the mutant allele encompassing the single nucleotide poly- 

15 morphism A primer consists of a consecutive sequence of polynucleotides complementary to a 
region in the allele encompassing the position which is mutated in the mutant allele but mat does 
not amplify the pseudogene. PCR primers complementary to a region in the wild-type allele 
corresponding to the mutant PGR primers are also made to serve as controls in the diagnostic 
methods of the present invention. The size of these PCR primers ranges anywhere from five bases 

20 to hundreds of bases. However, the preferred size of a primer is in -the range from 10 to 40 bases, 
most preferably from 14 to 32 bases. 

To amplify the region of fee genomic DNA of fee individual patient who may be a carrier for fee 
mutant allele, primers to one or both sides of fee targeted position, Le. fee SNPs of Table 1', are 
made' and used in a PCR amplification reaction, using known methods in fee art (e.g. 
25 Massachusetts General . Hospital & Harvard Medical School, Current Protocols In Molecular 
Biology, Chapter 15 (Green Publishing Associates and Wiley-Interscience 1991) and fee primers 
and probes of Table 2. For example for SNP1 fee primers SP900295F and SP900295R are used. 
For fee preferred protocols and methods see the Materials and Methods section and Examples. 

According to fee method of fee present invention, once an amplified specific TPMT fragment is 
30 obtained (without amplifying fee pseudogene), it can be analyzed in several ways to determine 
whether fee patient has one or more of fee here described mutant alleles of fee TPMT gene. For 
example, fee amplified fragment can be simply sequenced and its sequence compared wife fee 
wild-type cDNA sequence of TPMT. If fee amplified fragment contains one or more of fee single 
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nucleotide polymorphisms described in the present invention and/or the VNTR contains a higher- - 
number of repeats A and/or B (for example 3 or more B repeats), the patient is likely to have 
TFMT-deficiency or be a heterozygote (i.e., reduced activity) and therefore, develop hematopoietic 
toxicity when treated with standard amounts of mercaptopurine, azathioprine, or thioguanine. 
5 Alternatively, a combination of PGR fragment amplification and TaqMan or other genotyping 
analysis is used to determine TPMT genotype of the individual. 

In a preferred embodiment of the invention, a fragment of the genomic DNA of the patient is 
amplified by TaqMan (Lee et aL, Nucleic Acids Research 1993, 21: 3761-3766) analysis using the 
primers and probes of Table 2 of a respective SNP. 

10 To determine whether Hie individual is homozygous or heterozygous for TPMT, the mutation sites 
on the genomic DNA are amplified separately by using wild-type and.mutant primers. If only a 
wild-type or a mutant-type fragment is amplified, the individual is homozygous for the wild-type 
or the particular mutant-type TPMT. However, presence of more than one type of fragment 
indicates that the individual is heterozygous for TPMT allele, 

15 An example of a diagnostic assay that is carried out according to 'the present invention to 
detehnine the TPMT genotype of a person is as follows, This example is provided for illustrative 
purposes and is not meant to be limiting. 

Tissue containing DNA (e.g,, not red blood cells) from the subject is obtained. Examples of such 
tissue include white blood cells, mucosal scrapings of the lining of the mouth, epithelial cells, et 

20 cetera. Genomic DNA of the individual subject is isolated from this tissue by the known methods 
in the art, such as phenol/chloroform extraction or commercially available kits like QiaAmp™ 
DNA kits from Qiagen, Hilden, Germany, An aliquot of the genomic DNA of the subject can be 
used for PGR amplification of the TPMT gene. PCR primers encompassing the SNPs 1-50 are 
listed in Table 2 and are marked with an F or R in the ID name (forward and reverse primer) For 

25 each specific SN? one primer pair is chosen for example SNP1 can be amplified with SP900295 F 
and SP90029SR. The listed primers are examples for amplification, other primers can be designed 
by those skilled in the art. Next, the amplicons are analyzed by the various methods described 
above, which include Taqman analysis, sequencing, mutation-specific amplification, 
Pyrosequencing™, or other methods that are known to those in the art to measure genotypes. 

30 Hence, an efficient and simple method of obtaining information regarding the TPMT genotype in 
the patient is now made available which aids the physician in choosing the therapeutic modality 
for the patient 
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Definitions 

For convenience, the meaning of certain terms and phrases employed in the specification, 
examples, and appended claims are provided below. Moreover, the- definitions by itself are 
intended to explain a further background of the invention, 

5 The term "algorithm" in this file refers to a sequential analyzing of a number of SKPs in their 
respective genotypes and defines which genotype of each SHP will have- a predictive meaning for 
XPMT deficiency. For clarification an example is given for 4 SNPs; 





SNP-A 


SNP-B 


SNP-C 


SNP-D 


Polymorphism 


G/A 


G/A 


cn 


A/C 


Algotrihml 


GG 


+ G/A or AA 


+ CC 


+ CC 


Algotrihm?. 


GG 




+ CC 


+ AC 



Results: ■ Algorithml (combination of 4 SNPs) predicts i.e. reduced enzymatic activity when 
10 SNP-A is GG and SNP-B is G/A or AA and SNP-C is CC and SNP-D is CC. 

Algorithm^ (combination of 3 SNPs) predicts Le. total deficient activity when 
SNP-A is GG and SNP-C is CC and SNP-D is AC. 

Identified algorithms are called in this file haplotypes (haplotype 1 , 2 etc.) 

The term "allele", which is used interchangeably herein with "allelic variant" refers to alternative 
15 forms of a gene or portions thereof. Alleles occupy the same locus or position on homologous 
chromosomes. When a subject has- two identical alleles of a gene, the subject is said to be 
homozygous for the gene or allele. When a subject has two different alleles of a gene, the subject 
is said to be heterozygous for the gene. Alleles of a specif c gene can differ from each other in a 
single nucleotide, or several nucleotides, and can .include substitutions, deletions, and insertions of 
20 nucleotides. An allele of a gene can also be a form of a gene containing a mutation. 

The term "allelic variant of a polymorphic region of a gene" refers to a region of a gene having one 
■of -several nucleotide sequences found in that region of the gene in other individuals^ 

"Homology" or "identity" or "similarity* refers to sequence similarity between two peptides or 
between two nucleic acid molecules. Homology can be determined by comparing a position in 
25 each sequence, which may be aligned for purposes of comparison. Wben a position in the 
compared sequence is occupied by the same base or amino acid, then the molecules are 
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homologous at that position. A degree of homology between sequences is a function of the number * 
of matching or homologous positions shared by the sequences. 

The term „pseudogene" refers to sequences that have a high homology to identified genes and are 
generally untranscribed and untranslated due to non-functional promoters, missing start codons or 
5 other defects. Most Pseudogenes are intronless and represent mainly the coding sequence of the 
parent gene, For some cases it has 'been shown that in different organisms or tissues functional 
activation may occur. * 

The term "intronic sequence" or "intronic nucleotide sequence 11 refers to the nucleotide sequence 
♦ of an intron or portion thereof 

10 The term "locus" refers to a specific position in a chromosome. For example, a locus of a -gene 
refers to the chromosomal position of the gene. 

The term "molecular structure" of a gene or a portion thereof refers to the structure as defined by 
the nucleotide content (including deletions, substitutions, additions of one or more nucleotides), 
the nucleotide sequence, the state of methylation, tod/or any other modification of the gene or 
15 ' portion thereof 

The term "mutated gene" refers to an allelic form of a gene, which is capable of altering the 
" phenotype of a subject having the mutated gene relative to a subject, which does not have %> 
mutated gene. If a subj ect must be homozygous for this mutation to have an altered phenotype, the 
mutation is said to be recessive. If one copy of the mutated gene is sufficient to alter fee genotype 
20 of the subject, the mutation is said to be dominant. If a subject has one copy of the mutated gene 
and has a phenotype that is intermediate between that of a homozygous and that of a heterozygous 
(for that gene) subject, the mutation is said to be co-dominant 

As used herein, the term "nucleic acid" refers to polynucleotides such as deoxyribonucleic acid 
(DNA), and, where appropriate, ribonucleic acid (KNA). The term should also be understood to 

25 include, as equivalents, derivatives, variants and analogs of either KNA or DNA made from 
nucleotide analogs, including peptide nucleic acids (PNA), morpholino oligonucleotides (J. 
Summerton and D. Wetter, Antisense and Nucleic Acid Drug Development 7:187 (1997)) and, as 
applicable to the embodiment being described, single (sense or antisense) and double-stranded 
polynucleotides. Deoxyribonucleotides include deoxyadenosine, deoxycytidine, deoxyguanosine, 

30 and deoxythymidine. For purposes of clarity, when referring herein to a nucleotide of a nucleic 
acid, which can be DNA or an KNA, the term "adenosine", "cytidine", "guanosine", and 
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"thymidine" are used. It is understood that if the nucleic acid is KNA, a nucleotide having a uracil * 
base is uridine, , 

The term "polymorphism" refers to the coexistence of more than one form of a gene or portion 
thereof. A portion of a gene of which there are at least two different forms, Le,, two different 
5 nucleotide sequences, is referred to as a "polymorphic region of a gene 11 . A polymorphic region 
can be a single nucleotide, the identity of which differs in different alleles. A polymorphic region 
can also be several nucleotides long* 

A "polymorphic gene" refers, to a gene having at least one polymorphic region. 

To describe a "polymorphic site" in a nucleotide sequence often there is used an "ambiguity code" 
10 that stands for tie possible variations of nucleotides in one site: The list of ambiguity codes is 
summarized in the following table: 



Ambiguity Codes 
(IUPAC Nomenclature) 


. Code 


Nucleotides 


B 


c/g/t 


D 


a/gft 


H 


a/c/t 


K 


g/t 


M 


a/c 


N 


a/c/g/t 


R 


a/g 


S 


c/g 


y 


a/c/g 


w 


aft 


Y 


c/t 



For example, a "R" in a nucleotide sequence means that either an "a" or a "g?' nucleotide could be 
at that position. 

15 The terms "protein 1 ', "polypeptide" and "peptide" are used interchangeably herein when referring 
to a gene product. 
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A "regulatory element", also termed herein "regulatory sequence is intended to include elements 
which are capahle of modulating transcription from a basic promoter and include elements such as 
enhancers and silencers. The tram "enhancer", also referred to herein as "enhancer element", is 
intended to include regulatory elements capable of increasing, stimulating, or enhancing 

5 transcription from a basic promoter. The tenn "silencer", also referred to herein as "silencer 
element" is intended to include regulatory elements capable of decreasing, inhibiting, or repressing 
transcription from a basic promoter. Regulatory elements are typically present in 5' flanking 
regions of genes. However, regulatory elements have also been shown to be present in other 
regions of a gene, in particular in inixons. Thus, it is possible that genes have regulatory elements 

10 located in inixons, exons, coding regions, and 3 ! flanking sequences. Such regulatory elements are 
also intended to be encompassed by the present invention and can be identified by any of the 
' assays that can be used to identify regulatory elements in 5' flanking regions of genes. 

As used herein, the term "specifically hybridizes" or "specifically detects" refers to the ability of a 
nucleic acid molecule of the invention to hybridize to at least approximately 6, 12, 20, 30, 40, 50, 
15 60, 70, SO, 90, 100, 1 10, 120, 130 or 140 consecutive nucleotides of either strand of a gene. 

The term "wild-type allele" refers to an allele of a gene which, when present in two copies in a 
subject results in a wild-type phenotype. There can be several different wild-type alleles of a 
specific gene, since certain nucleotide changes in a gene may not affect the phenotype of a subject 
having two copies of the gene with the nucleotide changes. 

20 "Adverse drug reaction" (ADR) as used herein refers to an appreciably harmful or unpleasant 
reaction, resulting from an intervention related to the use of a medicinal product, which 

predicts hazard from future administration and warrants prevention or specific treatment, or 
alteration of the dosage regimen, or withdrawal of the product. In it*s most severe form an ADR 
might lead to the death of an individual. 

25 The term "drug response" is intended to mean any response that a patient exhibits upon drug 
' administration. Specifically drug response includes beneficial, i.e. desired drug effects, ADR or no 
detectable reaction at all. More specifically the term drug response could also- have a qualitative 
meaning, i.e. it embraces low or high beneficial effects, respectively and mild or severe ADR, 
respectively. An individual drug response includes also a good or bad metabolizing of the drug, 

30 meaning that "bad metabolizers" accumulate the drug in the body and by this could show side 
effects of the drug due to accumulative overdoses. 
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The term "haplotype" as used herein refers to a group of two or more SNPs that are functionally 
and/or spatially linked. Haplotypes of this- file are described by an algorithm. Haplotypes are 
expected to give better predictive/diagnostic information than a single SNP. 

The term "haplotype block" as used herein- refers to the observable linkage of SNPs between 
5 recombination hot spots the locations where homologous recombination between maternal and 
paternal chromosomes takes place during meiosis. Hot spots on chromosomes have distances 
between roughly 5000 to 100,000 base pairs. SNPs between hot spots are in higher linkage than 
SNPs outside the blocks. Haplotypes blocks can experimentally be identified through genotyping a 
number of neighboring SNPs on a chromosome and analyzing which SNPs are linked (have a 
10 comparable genotype pattern). 

The term "deficient TPMT activity" in a person can mean absent or very low TPMT activity or it 
can mean intermediate activity, which is between very low, and the low-end of normal TPMT 
activity. 

Diagnostic and Prognostic Assays 

15 The present invention provides methods for detennining the molecular structure of at least one 
polymorphic region of a gene, specific allelic variants and haplotypes of said polymorphic region 
being associated with TPMT deficiencies. In one embodiment, determining the molecular structure 
of a polymorphic region of a gene comprises determining the identity of the allelic variant 'A 
polymorphic region of a gene, of which specific alleles are associated with TPMT deficiencies can 

20 be located in an exon, an intron, at an intron/exon border, or in the promoter or other 5' or 3' 
flanking regions of the coding sequence of the gene. 

hi case of analyzing TPMT gene polymorphisms a TPMT gene-specific amplification is recom- 
mended to omit interference of sequences fiom the TPMT pseudogene as discussed above. 

The invention provides methods for determining whelher a subject has a functional defect in 
25 metabolising thiopurines or structural analogues that are metabolized by TPMT. 

In preferred embodiments, the methods of the invention can be characterized as comprising 
detecting, in a sample of cells fiom the subject, the presence or absence of specific allelic variants 
of one or more polymorphic regions of a gene. The allelic differences can be: (i) a difference in the 
identity of at least one nucleotide or (if) a difference in the number of nucleotides, which 
30 difference can be a single nucleotide or several nucleotides. 
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Due to the presence of a TPMT pseudogene in fee human genome, which is highly homologuesto • 
fee exons of fee TPMT gene most detection methods, need first to amplify at least a portion of a 
gene prior to identifying fee allelic variant An example is given in fee following: Primers for 
gene-specific amplification have to be located in sequences on fee gene of interest feat show no 
5 homology to fee pseudogene, for example fee intron sequences of fee gene of interest or other 
sequences feat are unique to fee gene of interest. Those skilled in fee art find those unique 
sequences through pairwise alignment of homologous sequences of fee gene of interest wife fee 
help of bioinfotmatics tools like MegAlign™ (DNA Star) or ClustalW™ from fee Wisconsin 
Genetics Computer Group or ofcer programs. ' Amplification of fee gene fragments can be 
10 performed, e.g., by PCR and/or by Iigase chain reaction (LCR), according to methods known in fee 
art. In one embodiment, genomic DNA of a cell is exposed to two PCR primers and amplification 
for a number of cycles sufficient to produce fee required amount of amplified DNA. ha preferred 
embodiments, fee primers are located between 40 and 350 base pairs apart. Preferred primers for 
amplifying gene fragments of genes offers file are listed in Table 2 in fee Examples. . 

15 A preferred detection method is allele specific hybridization using probes overlapping fee 
polymorphic site and having about 5, 10, 20, 25, or 30 nucleotides around fee polymorphic region. 
Examples of probes for detecting specific allelic variants of fee polymorphic region are probes 
comprising a nucleotide sequence set forth in any of SNPs 1-41 in Table 1. In a preferred 
embodiment of fee invention, several probes capable of hybridizing specifically to allelic variants 
20 are attached to a solid phase support, e.g., a "chip". Oligonucleotides can be bound to a solid 
support hy a variety of processes, including lithography. For example a chip can hold up to 
250,000 oligonucleotides (GeneChip, Affymetrix). Mutation detection analysis using these chips 
comprising oligonucleotides, also termed "DNA probe arrays" is described e.g., in Cronin et al. . 
. (1996) Human Mutation 7:244 and in Kozal et al. (1996) Natare Medicine 2:753. In one 
25 embodiment, a chip comprises all fee allelic variants of at. least one polymorphic region of a gene. 
The solid phase support is then contacted wife a test nucleic acid and hybridization to fee specific 
probes is detected. Accordingly, fee identity of numerous allelic variants of one or more genes can 
be identified in a simple hybridization experiment For example, fee identity of fee allelic variant 
of fee nucleotide polymorphism of nucleotide G or A at position 16 of SNP1 in Table 1 and feat of 
30 other possible polymorphic regions can be determined in a single hybridization experiment. In case 
of TPMT gene analysis prior to hybridization experiments a gene-specific amplification is needed 
to get rid of fee pseudogene sequences which would interfere in hybridization experiments. 

Alternative amplification methods include: self sustained sequence replication (Guatelli, J, C. et 
al., 1990, Proc. Natl. Acad. Sci. U.S.A. 87:1874-1878), transcriptional amplification system 
35 (Kwoh,D. Y. etal, 1989, Proc. Nafe Acad. Sci. U.S.A. 86:1173-1177), Q-Beta Replicase (Lizardi, 
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P. M. et al./ 1988, Bio/Technology 6:1197), or any oUier nucleic acid amplification method, 
followed by the detection of the amplified molecules using techniques well known to those of skill 
in-fce art. These detection schemes are especially useful for the detection of nucleic acid molecules 
if such molecules are present in very low numbers. 

5 In one embodiment, any of a variety of sequencing reactions known in the art can be used to 
directly sequence at least a portion of a gene and detect allelic variants, e.g., mutations, by 
comparing the sequence of the sample sequence with me corresponding wild-type (control) 
sequence. Exemplary sequencing reactions include mose based on techniques developed by 
• Maxam and Gilbert (Proc. Nati Acad Sci USA (1977) 74:560) or Sanger (Sanger et al (1977) Proc. 
10 Nat. Acad. Sci 74:5463). It is also contemplated that any of a variety of automated sequencing 
' procedures may be utilised when performing the subject assays (Biotechniques (1995) 19:448), 
including sequencing by mass spectrometry (see, for example, U.S. Pat No. 5,547,835 and 
international patent application Publication Number WO 94/16101, entitled DNA Sequencing by 
Mass Spectrometry by H. Koster; U.S. Pat No. 5,547,835 and international patent application 
15 Publication Number WO 94^1822 entitled "DNA Sequencing by Mass Spectrometry Via 
Exonuclease Degradation" by H. Koster), and U.S. Pat. No. 5,605,798 and International Patent 
Application No. PCT/US96/03651 entitled DNA Diagnostics Based on Mass Spectrometry by H. 
Koster; Cohen et al. (1996) Adv Chromatogr 36:127-162; and Griffin et al. (1993) Appl Biochem 
Bioteclmol 38:147-159). It will be evident to one skilled in the art that for certain embodiments, 
20 me occurrence of only one, two or three of me nucleic acid bases need be determined in the 
sequencing reaction. For instance, A4rack or the like, e.g., where only one nucleotide is detected, 
can be carried out. 

Yet other sequencing methods are disclosed, e.g., in U.S. Pat No. 5,580,732 entitled "Method of 
DNA sequencing employing a mixed DNA-polymer chain probe" and U.S. Pat. No. 5,571,676 
25 entitled "Method for mismatch-directed in vitro DNA sequencing". 

In some cases, the presence of a specific allele of a gene in DNA from a subject can be shown by 
•restriction enzyme analysis. For example, a specific nucleotide polymorphism can result in a 
nucleotide sequence comprising a restriction site which is absent from the nucleotide sequence of 
another allelic variant 

30 In other embodiments, alterations in electrophoretic mobility are used to identify the type of gene 
allelic variant. For example, single strand conformation polymorphism (SSCF) may be used to 
detect differences in electrophoretic mobility between mutant and wild type nucleic acids (Orita et 
al. (1989) Proc Natl. Acad. Sci USA 86:2766, see also Cotton (1993) Mutat Res 285:125-144; and 
Hayashi (1992) Genet Anal Tech Appl 9:73-79). Single-stranded DNA fragments of sample and 
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control nucleic adds are denatured and allowed to renature. The secondary structure of single- • 
stranded nucleic acids varies according to sequence, the resulting alteration in electxophoretic 
mobility enables fee detection of even a single base change. The DNA fragments may be labeled or 
detected with labeled probes. The sensitivity of the assay may be enhanced by using RNA (rather 
5 than DNA), in which the secondary structure is more sensitive to a change in sequence, hi another 
preferred embodiment, the subject method utilizes heteroduplex analysis to separate double 
stranded heteroduplex molecules on the basis of changes in electrophoretic mobility (Keen at al. 
(1991) Trends Genet 7:5). 

m yet another embodiment, the identity of an allelic variant of a polymorphic region is obtained by 
10 analyzing the movement of a nucleic'acid comprising fee polymorphic region in poiyacrylamide 
gels containing a gradient of denaturant is assayed using denaturing gradient gel electrophoresis 
(DGGE) (Myers et al (1985) Nature 313:495). When DGGE is used as the method of analysis, 
DNA will be modified to insure that it does not completely denature, for example by adding a GC 
clamp of approximately 40 bp of high-melting GC-rich DNA by PCR In a mrther embodiment, a 
15 temperature gradient is used in place of a denaturing agent gradient to identify differences-in me 
■ mobility of control and sample DNA (Rosenbaum and Reissner (1987) Biophys Chem265: 1275). 

Examples of techniques for detecting differences of at least one nucleotide between 2 nucleic acids 
include, but are not limited to, selective oligonucleotide hybridization, selective amplification, or 
selective primer extension. For example, oligonucleotide probes may be prepared in which the 
20 known polymorphic nucleotide is placed centrally (allele-specific probes) and men hybridized to 
target DNA under conditions which permit hybridization only if a perfect match is found (Saiki et 
al. (1986) Nature 324:163); Saiki et al (1989) Proc. Natl Acad. Sci USA 86:6230; and Wallace et 
al. (1979) Nucl. Acids Res. 6:3543). Such allele specific oligonucleotide hybridization techniques 
may be used for the simultaneous detection of several nucleotide changes in different polymorphic 
. 25 regions of gene. For example, oligonucleotides having nucleotide , sequences of specific allelic 
variants are attached to a hybridizing membrane and this membrane is then hybridized with labeled 
sample nucleic acid. Analysis of the hybridization signal will then reveal the identity of the 
nucleotides of the sample nucleic acid. 

Alternatively, allele specific amplification technology which depends on selective PCR ampli- 
30 fication may be used. Oligonucleotides used as primers for specific- amplification may carry the 
allelic variant of interest in the center of the molecule (so that amphfication depends on 
differential hybridization) (Gibbs et al (1989) Nucleic Acids Res. 17:2437-2448) or at the extreme 
y end of one primer where, under appropriate conditions, mismatch can prevent, or reduce 
polymerase extension (Prossner (1993) Tibtech 11:238; Newton et al. (1989) Nucl. Acids Res. 
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17:2503). This technique is also termed "PROBE" for Probe OHgo Base Extension. In addition it ■ 
may be. desirable to introduce a novel restriction site in fee region of fee mutation to create 
cleavage-based detection (Gasparini et al (1992) Mol. Cell Probes 6:1). 

In another embodiment, identification of fee allelic variant is carried out using an oligonucleotide 
5 ligation assay (OLA), as described, e.g., in U.S. Pat No. 4,998,617 and in Landegren, U. et al., 
Science 241:10774080 (1988). The OLA protocol uses two oligonucleotides which are designed 
to be capable of hybridizing to abutting sequences of a single strand of a target. One of fee 
oligonucleotides is linked to a separation marker, e.g.,. biotinylated, and fee ofeer is. detectably 
labeled. If fee precise complementary sequence is found in a target molecule, fee oligonucleotides 
10 will hybridize such feat their teunini abut, and create a ligation substrate. Ligation then permits fee 
labeled oligonucleotide to be recovered using avidin, or another biotin ligand. Nickerson, D. A. et 
aL have described a nucleic acid detection assay feat combines attributes of PGR and OLA 
(Nickerson, D. A. et al., Proc. Natl. Acad. Sci. (U.S.A.) 87:8923-8927 (1990). In feis method, PCR 
is used to achieve the exponential amplification of target DNA which is then detected using OLA 

15 Several techniques based on feis OLA method have been developed and can be used to detect 
specific allelic variants of a polymorphic region of a gene. For example, U.S. Pat. No. 5,593,826 
. discloses an OLA using an oligonucleotide having 3'-amino group and a 5'-phosphorylated 
oligonucleotide to form a conjugate having a phosphoramidate linkage. In another variation of 
OLA described in Tobe et al. ((1996)Nucleic Acids Res 24: 3728), OLA combined wife PCR 

20 permits typing of two alleles in a single microtiter well By marking each of fee allele-specific 
primers wife a unique hapten, ie. digoxigenin and fluorescein, each LA reaction can be detected 
by using hapten specific antibodies feat are labeled wife different enzyme reporters, alkaline 
phosphatase or horseradish peroxidase. This systempermits fee detection of fee two alleles using a 
' . high throughput format feat leads to fee production of two different colors. 

25 The invention further provides methods for detecting single nucleotide polymorphisms in a gene. ' 
Because single nucleotide polymorphisms constitute sites of variation flanked by regions of 
invariant sequence, feeir analysis requires no more than fee determination of fee identity of fee 
single nucleotide present at fee site of variation and it is unnecessary to determine a complete gene 
sequence for each patient. Several methods have been developed to fecilitate the analysis of such 

30 single nucleotide polymorphisms. 

In one embodiment, fee single base polymorphism can be detected by using a specialized 
exonuclease-resistant nucleotide, as disclosed, e.g., in Mundy, C. R. (U.S. Pat. No. 4,656,127). 
According to fee mefeod, a primer complementary to fee allelic sequence immediately 3' to fee 
polymorphic site is permitted to hybridize to a target molecule obtained from a particular animal or 
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human. If fee polymorphic site on the target molecule contains a nucleotide that is complementary ■ 
to me particular exonuclease-xesistant nucleotide derivative present, then that derivative will be 
incorporated onto the end of me hybridized primer. Such incorporation renders the primer resistant 
to exonuclease, and thereby permits its detection. Since the identity of the exonuclease-resistant 
5 derivative of the sample is town/a finding that the primer has become resistant to exonucleases 
reveals that the nucleotide present in the polymorphic site of the target molecule was 
complementary to that of the nucleotide derivative used in the reaction. This method has the 
advantage that it does not require the determination of large amounts of extraneous sequence data. 

In another embodiment of the invention, a solution-based method is used for determining the 
10 identity of the nucleotide of a polymorphic site. Cohen, D. et al. (French Patent 2,650,840; PCT 
Appln. No. WO91/02087). As in the Mundy. method of U.S. Pat No. 4,656,127, a primer is 
employed that is complementary to allelic sequences immediately 3' to a polymorphic site. The 
method determines the identity of the nucleotide of that site using labeled dideoxynucleotide 
derivatives, which, if complementary to. the nucleotide of the polymorphic site will become 
15 incorporated onto the terminus of the primer. 

An alternative method, known as Genetic Bit Analysis or GBA TM is described by Goelet, P. et al. 
(PCT Appln. No. 92/15712). The method of Goelet P. et al. uses mixtures of labeled terminators 
and a primer that is complementary to the sequence 3' to a polymorphic site. The labeled 
. terminator that is incorporated is thus determined by ; and complementary to, the nucleotide present 
20 in fee polymorphic site of the target molecule being evaluated. In contrast to fee method of Cohen 
et al. (French Patent 2,650,840; PCT Appln. No. WO91/02087) fee method of Goelet, P. et al. is 
preferably a heterogeneous phase assay, in which fee primer or fee target molecule is immobilized 
to a solid phase. 

Recently, several primer-guided nucleotide incorporation procedures for assaying polymorphic 
25 sites in DNA have been described (Romher, J. S. et al., Nucl. Acids. Res. 17:7779-7784 (1989); 
Sokolov, B. P., Nucl. Acids Res. 18:3671 (1990); Syvanen, A. -C, et al., Genomics 8:684-692 
(1990) Kuppuswamy, M. N. et al., Proc. Natl. Acad. Sci. (Q.S.A.) 88:1143-1147 (1991); Prezant 
T. R et al., Hum. Mutat 1:159-164 (1992); Ugozzoli, L. et al., GATA 9:107-112 (1992); Nyren, P. 
et al., Anal Biochem. 208:171-175 (1993)). These methods differ from GBA TM in feat they all 
30 rely on fee incorporation of labeled deoxynucleotides to fescriminate between bases at a 
polymorphic site. In such a format, since fee signal is proportional to fee number of deoxy- 
nucleotides incorporated, polymorphisms feat occur in runs of fee same nucleotide can result in 

signals feat are proportional to fee length of fee run (Syvanen, A.-G, et al., Amer. J. Hum. Genet 

52:46-59 (1993)). 
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For detemaning fee identity of the allelic variant of a polymorphic region located in fee coding • 
region of a gene, yet other methods than those described above can be used. For example, using an ■ 
antibody specifically recognizing fee mutant F°*ein in, e.g., immunohistochemistry or hnmuno- 
precipitaiion can perform identification of an allelic variant, which encodes a mutated gene 

5 protein. Antibodies to wild-type gene protein are described, e.g., in Acton et al. (1999) Science 
271:518 (anti-mouse gene antibody cross*eactive wife human gene). Other antibodies to wild-type' 
gene or mutated forms of gene proteins can be prepared according to methods known in fee art. 
Alternatively, one can also measure an activity of a gene protein, such as binding to a lipid or 
lipoprotein. Binding assays are known in fee art and involve, e.g., obtaining cells from a subject, 

10 and performing binding experiments wife a labeled lipid, to determine whether binding to fee 
mutated form of fee receptor differs from binding to fee wild-type of fee receptor. 

If a polymorphic region is located in an exon, either in a coding or non-coding region of fee gene, 
the identity of fee allelic variant can be determined by determining fee molecular structure of fee 
mKNA, pre-mKNA, or cDNA. The molecular structure can be determined using any of the above 
15 described methods for determining fee molecular structure of fee genomic DNA, e.g., sequencing 
and SSCP. 

The methods described herein may be performed, for example, by utilizing pre-packaged 
diagnostic kits, such as those described above, comprising at least one probe or primer nucleic acid 
described herein, which may be conveniently used, e.g., to determine whether a subject is at risk <rf 
20 having TPMT deficiencies which can cause severe side effects when treated wife feiopurmes or 
analogues. 

Sample nucleic acid for using in fee above-described diagnostic and prognostic methods can be 
obtained from any cell type or tissue of a subject. For example, a subject's bodily fluid (e.g. blood 
or saliva) can be obtained by known techniques (e.g. venipuncture or swab, respectively) or from 
25 human tissues like heart (biopsies, transplanted organs). Alternatively, nucleic acid tests can be 
performed on dry samples (e.g. hair or skin). 

Diagnostic procedures may also be performed in situ directly upon tissue sections (fixed and/or 
' frozen) of patient tissue obtained from biopsies or resections, such feat no nucleic acid purification 
is necessary; Nucleic acid reagents may be used as probes and/or primers for such in situ 
30 procedures (see, for example, Nuovo, G. J., 1992, PCR in situ hybridization: protocols and 
applications, Raven Press, New York). 
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fc'additioa to methods which focus primarily on fhe'detection of one nucleic acid sequence, ■ 
profiles may also be assessed in such detection scheme, Fingerprint profiles may be generated, for 
example, by utilizing a differential display procedure, Northern analysis and/or RT-PCR. 

In practicing die present invention, the distribution of polymorphic patterns in a large number of 
individuals exhibiting particular mates for tbiopurme response is determined by any of the 
methods described above, and compared with the distribution of polymorphic patterns in patients 
that have been matched for age, ethnic origin, and/or any other statistically or medically relevant 
parameters, who exhibit quantitatively or qualitatively different status markers. Correlanons are 
achieved using any method known in me art, including nominal logistic regression, chi square tests 
or standard least squares regression analysis, ha mis manner, it is possible to establish statistically 
significant correlations between particular polymorphic patterns and particular thiopurine response 
statuses (given in p values). It is former possible to establish statistically significant correlate 
between particular polymorphic patterns and changes in drug response such as, would result, e.g., 
from particular treatment regimen, In Ms manner, it is possible to correlate polymorphic patterns 
15 withrespoBsivity to particular treatments, 

• m another embodiment of ^present invention two or more polymorphic regions are combined to 
define so called 'haplotypes'. Haplotypes are groups of two or more SNPs mat are functionally 
and/or spatially linked. It is possible to combine SNPs that are disclosed in the present invennon 
either with each other or with additional polymorphic regions to form a haplofype. Haplotypes aae 
20 expected to give better predictive/diagnostic information than a single SNP. ' 

m a preferred embodiment of the present invention a panel of SNPs/haplotypes is defined that 
predicts drug response. This predictive panel is then used for genotyping of patients on a platform 
that can genotype multiple SNPs at the same time (Multiplexing). Preferred platforms are e.g. gene 
chips (Affymetrix) or the Luminex LabMAP™ reader. But also newer developments are under 
way like planar waveguides or nanoparticles that could be used for multiplex genotyping. Thin 
film planar waveguides (PWGs) as used by Zeptosens, Witterswil, Switzerland, for example- 
consist of a 150 to 300 nm thin film of a material with high refractive index (e.g. Ta 2 0 5 or TK*>, 
which is deposited on a transparent support with lower refractive index (e.g. glass or polymer). A 
parallel laser light beam is coupled into the waveguiding film by a diffractive grating that is etched 
or embossed into the substrate. The light propagates within this film and creates a strong 
evanescent field perpendicular to the direction of propagation into the adjacent medium. The field 
strength decays exponentially with the distance from the waveguide surface, and its penetrauon 
depth is limited to about 400 nm. This effect can be utilized to selectively excite only fluorophores 
located at or near the surface of the waveguide. 
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For diagnostics applications, specific captures are immobilized on fee waveguide surface. The 
• presence of the analyte in a sample applied to a PWG chip is detected using fluorescent reporter 

molecules attached to the analyte or one of its binding partners in the assay. Upon fluorescence 
' excitation by fee evanescent field, excitation and detection of fluorophores is restricted to fee 
5 sensing surface, whilst signals from unbound molecules in fee bulk solution are not detected. 
' Using feiB technology it is possible to detect polymorphisms in fee TPMT gene but one has to be 

careful in designing fee capture probes in respect to fee pseudogene (see discussion above on 

identifying non-homologous sequences between gene and pseudogene. 

Alternatively, nanoparticles could .be used feat emit different fluorescent colors so that a 
10 multiplexing can be set-up for several SNP assays in one reaction as discussed for example in 
(Expert Rev MolDiagn. 2003; 3(2): 153-61). 

The subsequent identification and evaluation of a patient's haplotype can then help to guide 
specific and individualized therapy. 

For example fee present invention can identify patients exhibiting genetic polymorphisms or 
15 haplotypes which indicate an increased risk for adverse drug reactions. In that, case fee drug dose 
should be lowered in a way feat fee risk for ADR is 'diminished. 

It is self evident feat fee ability to predict a patient's individual drug response should affect fee 
formulation of a drug, i.e. drug formulations should be tailored in a way feat feey suit fee different 
• patient classes (low/high responder, poor/good metabolizer, and ADR prone patients). Those 
20 different drug formulations may encompass different doses of the drug, i.e. fee medicinal products 
contains low or high amounts of fee active substance, hi another embodiment of fee invention fee 
drug formulation may contain additional substances feat facilitate fee beneficial effects and/or 
diminish fee risk for ADR (Folkers et al. 1991, US Pat. 5,316,765). 

TsolBtedPolvmon f>h fc Nttd rf '' Adrts - andprobes 
25 The present invention provides isolated nucleic acids comprising fee polymorphic positions 
described herein for human genes. The invention also provides probes, which are useful for 
detecting these polymorphisms. 

in practicing fee present invention, many conventional techniques in molecular biology. Such 
techniques are well known and are explained felly in, for example, Sambrook et al, 2000, 
30 Molecular Cloning: A Laboratory Manual, Third Edition, Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, New York; DNA Cloning: A Practical Approach, Volumes I and H, 1985 (D. 
N. Glover ed.); Oligonucleotide Synthesis, 1984, (M. L.Gait ed.); Nucleic Acid Hybridization, 
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1985, (Hames and Higgins); Ausubel et ah, Current Protocols in Molecular Biology, 1997, (John ■ 
Wiley and Sons); and Methods in Ecology Vol. 154 and Vol 155 (Wo and Grossman, and 
Wu, eds., respectively). 

The nucleic acids of the present invention find use as probes for the detection of genetic 
5 polymorphisms and as templates for the recombinant production of normal or variant peptides or 
polypeptides encoded by genes listed in the Examples. 

Probes in accordance with the present invention comprise wifoout limitation isolated nucleic acids 
of about 10-100 bp, preferably 14-75 bp and most preferably 15-25 bp in length, which hybridize 
at high stringency to one or more of the polymorphic sequences disclosed herein or to a sequence 
10 immediately adjacent to a polymorphic position. Furthermore, in some embodiments a full-length 
gene sequence may be used as a pn>be. In one series of embodiments, the probes span the 
polymorphic positions in genes disclosed herein. In another series of embodiments, the probes 
correspond to sequences immediately adjacent to the polymorphic positions. 

Kits 

15 As set forth herein, the invention provides diagnostic methods, e.g., for determining the identity of 
the allelic 'variants of polymorphic regions present in foe gene loci of genes disclosed herein, 
wherein specific allelic variants of the polymorphic region are associated with TPMT deficiencies. 
In a preferred embodiment, the diagnostic kit can be used to determine whether a subject is at risk 
suffering severe side effects when treated with thiopurines. This information could then be used, 

20 e.g,, to optimize treatment of such individuals. 

3b preferred embodiments, the kit comprises probe and primers, which are capable of hybridizing 
to a gene and thereby identifying whether foe gene contains an allelic variant of a polymorphic 
region which is associated with TPMT deficiencies. The kit further comprises an algorithm that 
identifies from a combination of SNPs foe grade of TPMT deficiency. The kit preferably further 
25. comprises instructions for use in diagnosing a subject having TPMT deficiencies. The probe or 
primers.of foe kit can be any of foe probes or primers described in this file. 

Preferred kits for amplifying a region of a gene comprising a polymorphic region of interest 
comprise one, two or more primers. 
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Material and Methods 

Genotyping using the' ABI 7700/7900™ instrument (for TaqMan analysis) . 

Genotyping of patient DNA using the TaqMan (Applied BiosysWPerkin Elmer) was performed 
according to the manufacturer's instructions. The TaqMan assay is discussed by Lee et al., Nucleic 
Acids Research 1993, 21:- 3761-37.66. 

Human Subjects 

Whole blood was obtained from a central diagnostic lab as waste material, individual labels on the 
tubs were removed irreversibly and replaced by a new number. DNA was isolated with 
commercially available kits (QIAamp DNA Blood Mini Kit) from Qiagen, Hilden, Germany . 
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Examples 

The Mowing examples are intended to further illustrate certain preferred embodiments of fee 
invention and are not intended to be-limiting in nature. 

Genotyping Assays 

DNA of ca. 1300 anonymyzed blood samples was genbtyped for 50 SNPs that are listed in Table 1. 
The sequence of each SNP is given in the table wift each SNP in the middle of foe sequence. The 
position is also given in numbers where'foe SNP can be found on foe sequence. As a reference foe 
' TPMT gene sequence was taken from foe NCBI, accession number AL589723, foe sequence was 
reversed and complemented as foe coding sequence for foe TPMT enzyme is not given hi 5' - 3' 
direction. For all 50 SNPs a TaqMan assay was designed with PGR primers and TaqMan probes 
for each allele with different dyes as listed in Table 2. The general protocol for using TaqMan is 
mentioned above. One example protocol is given in Table 3 for describing concentrations of 
primers and probes, DNA, and other parameters like cycle temperatures and times. 

Table 1: SNP sequences of SNPs 1-50 including surrounding sequence plus respective position in 
Accession number AL589723 (reverse complement). Nine reference SNPs were included for 
benchmarking. SNPs 44, 47, and 50 have proven genotype to phenolype correlation (see above). 
SNPs 42, 43, 45, 46, 48, and 49 are from a recent patent appHcation,-WO 03/066892 Al. 
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rAGGATGAC 












A16C i 


VATAAATAAATAAATMAATC 
rAGGTTTCCAA 


SEQ-lu J 








-R ^931660 


T16A 


rGCACATTTAATTCTWCACA 
rTTTTGTGTCT 


SEQ-1 1 




yuuzy? 






T16A !' 


rGCTGAGTAAAGTGGWTGTT 
A.GAGACATTCC 


PEA "S O 

SEQ-I2 


bJMr iZ 




P /V? I j 




C16T 


rCTCAGGTTTACn:CYGAGG 
CTTGAGTACAC 


SEQ-13 


bNr Id 




XI*) \{\ 


r«? 251R472 


A16T 


TAATAAAGAATTTTCW AAAC 
ATCCCCAAGAA 


3EQ-14 


CVxTD 1 /I 

bJNr 14 






rs 3928922 


T16A 


AGTGTTCACCTACCAWACAA 
TTGTCCTAAAA 


CCA 1 < 


£>JNJr 1 2 








■ T16C 


TCCTCTTCAGGCTATYAAAG 
AAGC ATTT AG 




biNr jld 






t« 4449636 


C16T • 


AACAGAATTATCTTGYCTTA 
ATGATGAATTC 


SEQ-17 


bNxrX f 




^76Afi 




G16T 


AAACTCCATTTTCAGKAAAT 
ACACAGAAAT 


SEQ-18 




onn^ 1 7 
1 / 




r« ^RQR1^7 
is joyoij / 


C16T 


TTCCCTTTTACATTTYCTGGA 
TCCTTGTATG 


SEQ-19 








tr74S4407 


G16A 


GTAATTCTCTACAAARAGAA 
TTCACTTTAAC 




oINrZU 








T16A 


ATTTTAGGAAGGCACWTGTT 
ACATTATAGCA 


SEQ-21 










G16T 


GAACTTGGGATACAAKAATT 
TTTTACAGAG 




CKTD70 








C16T 


AGAAGAACCAATCACYGAA 
ATTCCTGGAAC 


SEQ-23 


SNP23 


900277 


41835 


1^2518463 


T16C 


AAAAGTTTTTCTCAGYGTGA 
GTATTATGAGG 


SEQ-24 


SKP24 


900340 


44295 




C13A 


GGGCCCTGGCATMAGTACTG 
TTT 


SEQ-25 


SNP25 


900303 


45354 


rs 2842936 


Al 6G 


TAGCAGAGTAAAAATRTCAC 
TCTGCTCGAGG 
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SEQ-ID Si 


NT-H) B 


ay-SNP F 


osition rs 


#{NCBI) IS 


NPPos.iaSeq, 


Sequence around SNP 


SBQ-26~i 


NP26 


900278 


45429 rs 


284293 


A T6G C 
T 


CAACTGATCnTCAARGTTG 
CCTCTGTGAT 


SBQ-27 S 


NP27 


900319 


46390 ri 


j 2842934 


PICT > A 
C 


.GCATTAGTTGCCATYAATC 
JAGGTGATCGC 


SEQ-28 S 


isrp28 


900311 


46777 r 


3 2859778 


T16G 1 

c 


X3GTCAC1TGCCTATKCCAG 
tTATTGTTCAA 


SEQ-29 S 


JNP29 


900312 


47890 r 


s 4712327 


T16P 

i X ov^ J 


rATAGCATGGAAATAYTGAA 
[TACTTAGTTG 


SBQ-30 ! 


SNP30 


900292 


48260 i 


s 2842955 


A 1 £P > 

.j 


UlCAGGTTAGGCTCCMCATC 
4GTGAAATAAG 


SEQ-31 


SNP31 


900313 


48568 : 


:s 2842952 




"TTTITnTCGAGACRGAGT 
rTCGCTCTTGT 


SEQ-32 


3NP32 


900304 


49788 


rs 2518467 


r^i &p 


CGTGOCCAGC5CTTATSTTAG 
TATTTTATATA 


SEQ-33 


SKP33 


-900305 


49921 


rs 2842951 




CTCCTTAGATTGTAC&TTGTC 
AAGTACTGAT 


SEQ-34 


SNF34 


900293 


5042$ 


r"ift A ^r\ci\ 

rs 2842950 




GTCTAGCCAGGCTCCRTAGA 
AACTGGA0TGC | 


SBQ-35 


SNP35 


900324 


51526 


rs 6921269 




GGGAAAGAAGTTTCAKTATC 
TCCTGTGTGTT 


SEQ-36 


SNP36 


900280 


52782 


rs 2842947- 




CTGGAGGTGGAGTCTRAGGA 
TACTGCTCTTA 


SEQ-37 


SNP37 


' 900281 


54592 


rs 1800584 


rti &a 


CTCTTTCTTGTTTCARGTAAA 
ATATGCAATA 


SEQ-38 


SNP38 


900332 


54648 






TTTTGAAGAACGACAZAAAA 
GTTGGGGAAT 


SEQ-39 


SNP39 


900283 


55383 


rs 1802650 




GGCCTGACATTCTTTWTGAA 
ATTTAGAATTG 


SEQ-40 


SNF40 


900284 


56323 


rs 2842944 


C16G 


GGTCTCACTTTGTTGSCCAC 
GCTGATGTTGA 


SBQ-41 


3HP41 


900285 


56945 


rs 7886 


' A16T 


CTTAGGTAGTTGATCWTTTA 
TGTAATATGTG 












3EQ-42 |£KP42 


| 900326 


j 36369 




j C11G 


TGCTTTTCATSAGGAACAAGG 
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SEQ-H> I 


5NF-IB 3 








5MPPos.inSeq. 


Sequence around SNP 


SEQ-43 




o run 9 7 


37528 T 




Gil A 


rocrcrnocRGAAAAGoaGT 


SEQ-44 


SNP44 


900276 


41649 : 


cs 1800462 


G16C 


&TTTTATGCAGGTTTSCAGA 
CCGGGGACACA 


SEQ-45 


SNP45 


900328 


41767 




A11C 


CCTGGAACCAMAGTATTTAA 
GG 


SEQ-46 


SNP46 


900329 


45oo4 




Gil A 


TCATTGTACTRTTGCAGTATT 


SBQ-47 


SNP47 


900279 


46376 


rs 1800460 


G16A 


ATTTGGGATAGAGGARCATT 
AGTTGCCATTA 


SEQ-48 


SNP48 


900330 


46404 




G11A 


CCAGGTGATCRCAAATGGTA 
A 


SEQ-49 


SNP49 


900331 


54679 




AUG 


" TCTTTTTGAARAGTTATATCT 


5EQ-50 


SNP50 


900282 


54686 


is 1142345 


A16G 


TnTTGAAAAGTTATR.TCTA 
CTTACAGAAAA 



10 



Table 2: Primer and probe sequences of SNPs 1- 50. The first column describes to SNP-ID for 
SNPs 1-50, to second column describes to primer and probe 3D of each SNP. The nomenclature 
of primers and probes is as follows: SP900xxx stands for to respective SNP (for example 
SP900295 is SNP 1) followed by one or more alphabetic letters: F andR describe to forward and 
reverse primers, or one of to four base symbols A, C, G, T followed by a *P or «-« describe to 
probes- to 5' dye type of to probes are symbolized by FAM, VIC. Tet «+»' probes have'a 
MGE/DarlsQuencher at 3' end, probes use TAMRA as Quencher. "Out" at to end of to 
primer's name stands for first primers in nested PGR or outer primer, «AoD» stands for Assays- 
on-Demand™ from Applied Biosystems (commercially available assays). 



SEO-IB SNP-ID 3 


Primer/Probe-ID 


Sequence 


bp f 


Tm°C 


SEQjl^ 




SP9^295^am ' 


PATTITTTCTcCTCCTTGCAT 


21 


66,4 


SBQJl 


SNP1 


SP900295F 


TTCTCCAACCTGTTAGCAATCCTA 


24 




SEQ^53 


SNP1 


SP900295R . 


GTGAAAGTGAATTATATGGATGATGGTAA 


29 




SEQJ4 


SKPI 


SP900295T+Vic 


AGTATTTTTTCTCTCCTTGCA 


22 


66,4 


SEQJ5 


SNP2 


SP900294A+Faxa 


TTTCTCTTTCAaATAACACTAT 


22 


65,5 


SEQ_56 


SNP2 


SP900294F 


CAACATAGCAACACCCTGTATCAAG 


25 




SEQJ7 


SN?2 


SP900294G+Vic 


TCXCTTTCAgATAACACTAT 


20 


65,2 


3BQJS8 


SHP2 


SP900294R 


CCCATAAAACAGGCTGTCAGAAG 


23 




SEQJ9 


SNP3 


SP900296F 


CTGGCCCTCTTTGTGTTTAAAAA 


23 




SECL60 


SNP3 


3P900296G+Fam 


TCTCGATgTTATTGAAC 


17 


65,9 


SBQJ1 


SISfFS 


SP900296R 


' CAGAGGAAAATATTCAATTAAGGGTTAAG 


1 
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SEQ-ED S3 


VTT> TT\ "D 

\Jr~JLLJ Xj 


»*i m at /P irnVi p^TTl 
aAAAOG-^TU-V-ipi T 


Sequence , 


bp T 


ra°C 1" 


SEQJ2 S s 


NP3 Sj 


^TCTCGATtTTATTGAAC ' 


20 


66,2 


s 


SIP4 A 




1916835_10 






SEQ_63 S 


NP5 S 


iDonnoQ*? A "Ram A 


.GCCCGCACaGCCTTGCAG 


19 


65,6 


SEQ_64 S 


NP5 b 




CTTCCCGGCCGATAGG 


17 




SBQ_65 S 


RP5 b 


TXlAAOGTrr Tfrt C 


'CCGCACeGCCTTGCAG 


17 


65,8 


SECL.66 S 


MP 5 S 




jCTGTGCCAGAGAATTACTACAACA 


25 




8EQ_67 £ 


>NP6 E 


>P9 00/ /3v>r am i 


"AGCCCGGGAATTcCCCCTTC 


21 


65,9 


SBQL68 £ 


JNP6 £ 


>P900273r2 v. 


A A C ATCGCGACGAA 


17 




SEQ_69 * 


>NP6 E 


sP9002731U i 


i T A rCTCCTGCCCCGGATTA 


20 




3EQJ70 £ 


3NP6 I 


SP90Uj£/5i-I£t r 


"AGncCGGGAATTtCCCCTTCTT 


23 


65,5 


SBQJ71 ! 


5NP7 i 


-iTkAAAOAC 1 A _l-T?*»"rrt 

3P9 OUzy SAtr am j 


ATTTCACAAaAATTTCT 


17 


66 s 6 


SEQJ72 1 


3NP7 


bP90t).iyor 


1CACATTACAAGAATTAAGGAAGGG 


25 




SEQJ73 


SGSJP7 


CTDAAAOOCU - ' 

SPyUUZyoK. 


TTGAGGACTITGTTTGTGGGC 


21 




3BQJ74 


SNP7 


5>py w/y b 1 *t- v ic 


AAATTTCACAATAATTTCT 


19 


66,5 4 


SEQJ75 


SNP8 


OTJAAft"! "2 0 A 4-T7qtvi 


TCCTACAGCTaACTGAATA 


19 


66,4 


SE<^76 


SNPS 


SP90(J33ar 


rATGGGTACTTTCCTCCTTTCATAA 


25 




SBQJ77 


SNP8 


oP9Wj^oK- ! 


TGAGGAAGGTGGCCAAATATACA 


23 




SBQJ78 


SNP8 


br 9UU3 j o i T- VIC 


TCCTACAGCTtACTGAATA 


19 


66,4 


SEQJ79 


SNP9 




CTTATAATGTAGGGTGATGTGAGTGGAT 


28 




SEQ_80 


SNP9 ■ 


or? UU J i4grr am 


AAACCTAGATTeATTTATTT 


20 


— 


SEQ_81 


SNP9 


SP90U314K 


GrGAGACCCTGCCTCAAA 


IS 




SBQJ2 


SKP9 


SP9U0iI4iT'Ylc 


A A ACCTAGATTtATTTATTTATTT 


24 


65,8 


SEQ_83 


SNP10 


1 DA+r am 


rACATTTAATTCTaCACATTT 


21 


66,3 


SEQJ4 


SNP10 


SP9003I5r 


" TGtTCTATCAAAAAGTGACITTGAGATAGA 


30 






SNP10 




" ATGCACTGTGACTCGGGAGAC 


21 




SEQ_&6 


SHP10 




CACATTTAATTCTtCACATTT 


21 


66,8 


* SEQ_87 


SNP11 


Pt>n AA*>OQ A 4-Ti!JTri 


TCTAACAaCCACTTTACT 


18 


66 


SBQJ&8 


3NP11 


£!Tn A A AO A AX? 


rTGrCCAGAACAAGGAATGTC 


21 




■ $BQ_39 


SNP11 


Cft»AAA r >O0"D 


- AGTAGTCTTCATAGCAGCAATAAATCATG 


29 




SEQ_90 


3NPI1 


br y wzyy it* y jc> 


TCTAACAtCCACTTTACT 


18 


65,7 


SBQ_91 


SNP12 


C<T>AA A1 AA A -i-T?«ir« 


r A AGCCTCaGAAGTA 


15 


66,1 


SBQ92 


SNP12 


SP900300F 


"'" TCAACATTAATTTCATGGTACGTTCTC 


27 




SBQJ3 


SNP12 


SP9O03OOg+Vic 


CAAGCCTC^jAAGTA 


15 


65,7 


SECL94 


SNP12 


SP9003O0R 


" GAAACTACAGGAGTTACACTTCTCAGGTT 


29 




SBQJ5 


SNP13 


SP900301AH-Fam 


AAAGAATTTTCaAAACATC 


19 


66,3 


SEQ_96 


SWU 


S SP90030XF 


TCCATGGCTCCAGAGGCTC 


19 




SEQJ1 


|SNP1: 


i SP9003015L 


CAGGGCTTTCCTGATTAGTAATTAAAAATA 


30 
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SEQ-ID S3 


\r-xD x 


—J ma V AP T* fil"k &-TT1 


Sequence 


bp T 


m°C 


SECV 98 


NP13 S 


?90D3Oll+vic A 


ap t a a TTTTCtAAACATGC 


19 


66,3 


SEQ_99 . S 


NP14 S 


nrifvfvll^ A JVC?-*-! »-vi 

?9uU2/4A+ram U 


a rrTACCAaACAAT 


15 


66,7 



seclioo S 


HP 14 S 


-r-H r\ r\ j™i ■»> O - #1 

P900274r u 


TTf^AATATTAAGTGAGATAATGAATGA 


30 




SEQJ01 " S 


MP14 5 


P900274il ■ -ft 


oTrrArTCTTGCCTTTAAGGAAA 


24 




SEQJG2 3 


jNP14 S 


P900274 jl+vic i 


Tr A rCTACCAtACAATT 


18 


66,4 


SEQJ03 S 


$ 


F90Q33 /L/*rraia v. 


,r rrr A GGCTATcAAAGA 


17 


66 


SEQJ 04 * 


INP15 £ 


>P90033 /r * 


< ATG AATGAAAAGTGTTCACCTACCA 


26 




SEQJ05 £ 


nsfpis £ 


SP900337E. v 


n T a rr ATTTCATCTCAACCGC 


22 




SEQJ06 £ 


5NP15 i 


>P9G0337T+Vic 


rrTTT A GGCTATtAAAGA 


18 


66,1 


SECL107 5 


SHP16 S 


3P900316C+Fam i 


itta TrTTGcCTTAATG ATG A 


21 


66,6 


SBQJ08 \ 


3NP16 * 


3P900316F ' t 


^roa A A A AGCGGTTGAGAT - 


19 


— — ■ 


SECL109 


SNF16 ! 


3P90Q315R 1 


a r a rrrTGTTAAATCACCCAAAG 


24 




SBQJ10 


SNP16 


SP90031oli-Vic 


ATTArrTTGicrTAATGATGAAT 


23 


67 


SBQJ11 


SNP17 


SP900336F 


finTGATTTAACAGGATGTGAGTTTTAAA 


28 




SBQJI2 


SKP17 


SP90G33o(j+r am 


r A TTTTCA Ge AAATACA 


17 


66,5 


SEQJ13 


SNP17 


SP9Q033oft 


A AnArTTCATACCTGTTTCTGTTGTTTCT 


29 




SEQ_114 


SNP17 


SP90033dI j tVic 


nr ATTTTCAGtAAATACA 


18 


66,6 


SEQJ15 


SNP18 


SP9003 1 


TT A P A TTTcCTGGATCCT 


18 


66,1 


SBQJ16 


SNP18 


SP9003 17F 


a A ArTTrTTTCTGGATTGAGTTTTGAA 


26 




SEQJ17 


SNP18 


SP900317R 


rr A rrTACAAAAACTGAACCACAT 


24 




SBQJL18 


SHP18 


SP90031 /l-rvic 


T A r A TTTtTTGG ATCCTT 


18 


66 ? 4 


SEQJ19 


SNP19 


SP9O03 1 o A+x am 


C-TC T A P A AAsAGAATTC 


17' 


66,7 


SEQJ2Q 


3NP19 


SP90031 Br 


A rr A GTGATTAAGAAAGT ATTTCTTGTGA 


29 




SBQJ21 


SNP19 


SP9003i8^Vic 


tptap a A AtrAGAATTCA 


17 


66,5 


SBQJ22 


SNP19 


'SP900318B, 


r^GT A ArTCATAGTAAAAGTGGCTTGTT 


28 




SEQJ2S 


SHP20 


SP900275A-tram 


ATGTA Ar AftGTGCCTTC 


17 


66 ? 5 


SEQ^124 


SNP20 


SP900275F , ( 


nr A r AGTTATGATTTTATGTCAAGTGAA 


28 


- 


. SEQ^X25 


SNP20 


SP900275K 


ATTTTT AGTGCGTGATTTAGCATAGTG 


27 




SEQJ26 


S>l?20 


SP9002/^i+Vic 


~ ATGTAACAtGTGCCTTC 


17 


67 


SEQJ27 


SMP21 


SP90U33 5 am 


rrrTGTAAAAAATTaTTGTATCC 


23 


66 


SEQ_128 


SNP21 


SP90033^L,+ V1C 


" TfTGTAAAAAATTcTTGTATCC 


22 


66 


SEQJ.29 


SNP21 


SP900335F 


GGGATATGGATACAATTATTTACCCAAA 


28 




SEQJSO 


SNP21 


SP900335K 


' TGGTGTGGAAATCAGTGAACTTG 


23 




SBQJ31 


SNP22 


SP900334C+Fam 


TGACcGAAATTC 


12 


65,9 




SNP22 


SP900334F 


* SP900328F 


11 






SHP22 


t SP900334R 


- SP900328R 


.11 




SEQ_^2 


SKP22 


» SP900334T+Vic 


ACCAATCACtGAAATT 


16 
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SEQ-ID hi 


Nr-JJJ X J 




Sequence 


bp jT) 




Sj 


nP33 A 


C 


396305 10 






SEQJ67 S 


kTTi*1 A CI" 

MP34 S 




AGTTTCTAcGGAOCCT 


17 


66,6 


SECL168 S" 


NF34 S 


nAAft^ATT? T 


TrrCCACACTGAGGAAGGA 


20 




SBQ_169 S 


NP34 S 




fTACTTGCCTCCCCAACTT 


19 




3ECL170 S 


NP34 S 


■Artnno fi'!i , T\i-.\J r -irf*« f™ 
FyUU-iyj J.'TYIC > 


'PACrTTTCTAtGGAGCC 


17 , 


66,6 


SEQJL71 S 


NP35 S 


T»AAA2 I '}/Tr f* 


iCCTGTGTAGAGAAATGTAACAAATACC 


28 




SEQJ72 £ 


NP35 S 


T» A Afll *"f j4 rr J_T? «i*n J 


i AGTTTCAeTATCTCCTG 


18 


66,4 


SEQJ73 £ 


>NF35 5 


<TlAAA'2'7yn> f 


5GATGTTTAGTTGGATCATAAGAAAGAA . 


28 




SBQJ74 S 


JNP35 z 


sryllUi*^ l t y ic / 


^GAAGTTTCAtrATCTCCT 


20 


66,7 


SEQJ75 £ 


5NP36 £ 




^GTATCCTeAGACTCC 


16- 


67 


SEQJ76 £ 


SNP36 i 


IfiAAAO DAT? 1 


DTTCCGCCCCCTTCTAAGAG 


20 




SBQJ77 * 


3NF36 * 


ZVDAAfy} QAT? 


AAAGAACCTTTGGGAAGAAAATACAG 


26 




SEQJ.78 


"1VTTV3 j£ 

3HP36 




CAGTATCCTtAGACTCC 


17 


66 s 6 


3EQJL79 




a J: ?UU o l -ft. ~JT flxu. 


TCTTGTTTCAaGTAAAATA 


19 


66,5 


SEQJ80 


SKP37 


CDOA1Y5S21T? 


CCTGATGTCATTCTTCATAGTATTTTAACA 


30 




SBQJ8I 


SNP37 


oJryUvio IvJT V It* 


TCTTGTTTCAgGTAAAAT 


18 


66,1 ■ 


SECL182 


SNP37 




CCTTCTCAAGACAACGTATATTGCA 


25 




SEQJ83 


SNP38 


C»T>AAA^l'5.'> A4-T?iiTV» 


OOAACTTTTaTGTCGTTCT 


19 


65,9 


SEQJ84 


SNP38 




rAACITTTcTGTCGTTCT - 


18 


65 9 5 • 


SBQ_iS5 


SNP38 


C?T>0AA^7 / ?T? 


CATCimGTGTGATiTrATm'ATCTATGTCTC 


33 




SEQ_J86 


SNP38 




CCTGATGTCATTCTTCATAGTATTTTAACA 


30 




SECL.187 


SNP39 


b r V UUZ & j/V^f am 


TTCTAAATTTCAaAAAGAATGT 


22 


65,8 


3EQ_188 


SHP39 




GACCAGCTTGAACCCTACTGAAA 


23 




. SBQJL89 


SNP39 


CDGAAORTE? 
biryUUZcSjJtv 


AGGCGTGAGCCACTGCA 


17 




SEQJ.90 


SNP39 




ATTCTAAATTTCAtAAAGAATGT 


23 


65,8 


SBQJ91 


SNP40 


oir ? UU/S^t c-F axn 


" TCTCACTTTGTTGcCCACGCTGAT 


24 


65,8 


SEQJL92 


SNP40 




GGACCAACACAATTCTCTCCAGA 


23 




SEQJ.93 






TCTGACTTTGTTGgCCACGCTGAT 


24 


65,8 


SBQ__194 






GGAGGACTGCTTGAGGCCTC ' 


20 






SHP41 


AOL* 


C 12O91548_10 






SEQ_195 


SKF42 




TCCTcATGAAAAGC 


14 


66,5 


SEQ_196 


SNP42 


SP900326F 


CAAAGTCACTTTTTGATAGAACATTTCTC 


29 




SEQJ97 


3NP42 


SP900326g+Vic 


TCCTgATGAAAAGC . 


14 


66,5 


SEQ_198 


SHP42 


. SP900326R 


AAGTGGGTGAACGGCAAGAC 


20 




SEQ_199 


SNP4: 


» SP900327C^Fans 


CAACCGCTTTTCcGCAAAGAGG 


22 


65,7 


SEQ_200 


SHP4; 


J SP900327F 


TTCTGTTAATGTTTATCTGCTCATACCA 


28 




SBQ_201 


SNP4: 


3 SP900327R 


GCAAGAGTGGACTGAGGGTATTTT 


24 





WO 2005/066362 



-30- 



PCTVEP2005/000064 



SEQ-H> SI 


VP-ED [primer/Probe-ID 


Sequence 


rm T 


m °C 


SEQJ202 SI 




?900327T-Tet T< 


wAACCGCTTTTCtGCAAAGAGGAA 




OJ,o 


SEQJ203 S3 


NEP44 S 


P900276OFam T 


CCCCGGTCTGcAAACCTGC 


on 
A) 




SECL204 S* 


NP44 S 


P900276F T 


CACTGATTTCCACACCAACTACA - 






SECL205 S 


NP44 S 


P900276G-Tet C 


CCCGGTCTGgAAACCTGCA 


Oft 
ZU 




pQJOS S 


NP44 S 


P90Q276R T 


BTTCirTGAAACCCTATGAACCTG 


Zj 




SEQ_207 S 


S 


P900328A+Fam C 


TGGAACCAaAGTATT j 




OOjT- 


SBCL208 S 


fNP45 £ 


iP90032SC+Vic C 


:tggaaccacAGTAtt 


Id 




SBQ.209 £ 


;KP45 f 


2P900328F J 


CCAGAGCAGAATCTTTCTTACTCAGAAG ■ 


o & 




SBQJ10 I 


3NP45 £ 


5P900328R < 


3GGATATGGATACAATTATTTACCCAAA . 


Z& 


66 1 


SECL211 I 


3NP46 ! 


3P900329CM-Fam ' 


rACTGCAAcAGTACAATG 


1 5 
1& 




SEQJin 


3NF46 


SP90Q329F 


rCAACCTACCTGGGAAGATCAAA 


02 

2.D 




SBQJ&13 


SNP46 


SP900329R 


GGCCCTCTTTCCTTGACTATTCA 


07 


fif, A 


SEQJ14 


SKP46 


SP900329T+Vic 


ATACTGCAAtAGTACAATGA 


on 




SBQJL15 


SHP47 ' 


SP900279OFam 


CAACTAATGcTCCTCTAT 


"i Q 

iO 




SBCL216 


SHP47 


SP900279Fout 


^AAACAAAAAAAGAAAAATTACITACGAT 




, — - 


SEQ_217 


SNP47 


SP900279F 


TGCGATCACCTGGATTGATG 


Oft 




SEQ„218 


SNP47 


SP900279Rout 


TCTTAAAGATTTGATTTTTCTCCCATAAA 






. SEQJ19 


SHP47 
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OA 
Z4 


0 
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1 O 
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'fcCTCAAAAACATGTCAGTGTGATT 
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Table 3: Example of a TaqMan PGR Protocol 



TaqMan PGR Protocol 
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TPMT Assay 

Erythrocyte lysates were analyzed for TPMT activity by a HPLC method using 6-thioguanine as 
substrate described in Kroeplin, T. et'al., Eur. J Clin. Pharmacol (1998) 54: 265-271. 

■ Sequencing of "VNTR 

5 Sequencing of the VNTR of the TPMT gene was performed with an ABI Prism™ 3700 (Applied 
Biosystems) using a protocol as described by the manufacturer wilh the following primers: ' 

VNTR-Seql gctccgccctgcccattt (forward) and 

VNTR-Seq2 gtcattggtggcggaggc (reverse)' 

In general, molecular techniques were performed according to Sambrook et al. Molecular Cloning, 
10 A Laboratory Manual, 3 ri Ed. 2000; Cold Spring Harbor Laboratory Press. 

The VNTR regions that were amplified with the primers VNTR-Seql + 2 ranged in length from 
233 to 377 bp with 1-6 repeats of A (gtcattggtggcggaggc), 1-3 repeats of B (gaggcggggcgcgggcg), 
and 1 repeat of C (gaggcggggcgcggaga). 

Results 

15 From the ca. 1300 DNA. samples we identified 135 uniquehaplotypes in the TPMT gene. Table 4a 
' shows me allele frequencies of all polymorphic SNPs; 20 SNPs were found to be monomorphic in 
our 1300 DNA samples (listed in Table 4b). Surprisingly, 5 out of 9 reference SNPs were 
monomorphic in the tested population. Although 5 out of 6 SNPs were taken from one patent 
application as reference SNPs and were meant to be used as benchmark SNPs! 

20 Table" 4a: Allele frequencies of all 30 polymorphic SNPs in ca.1300 samples. Reference SNPs 
are marked with a "R" in the- second column, linked SNPs are shaded, other SNPs, which are 
mentioned particularly in the text are marked either with a comma (,) or a dash (-). 
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Table 4b: 20 monomorphic SNPs were found in the 1300 DNA samples tested. 



snp-ed i 


iay-SNP-ED 


* 


SNPS • 


900297 i 


nonomorphic 


SNP6- 


900273 i 


nonomorphic 


SNP11 


900299 i 


nonomorphic 


SNP13 


900301 l 


cnonomOTphic 


SNP14 


900274 


monomorphic 


SNP15 


900337 


monomorphic 


SNP 19 


900318 


monomorphic 


SNP21 


900335 


monomorphic 
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monomorphic 
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900292 


monomorphic 
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900324 


monomorphic 
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900281 
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SNP38 
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monomorphic 


■ SNP39 


900283 


monomorphic 


SNP40 


900284 


monomorphic 


Reference SNPs: 


SNP42 


900326 


monomorphic 


SNP43 


900327 


monomorphic 


SNP46 


■ 900329 


monomoiphic 


SNP48 


900330 


monomorphic 


SNP49 


900331 


monomorphic 



10 



Table 5 shows all different hapkrtypes of 30 polymorphic SNPs in 5' to 3 s direction on the TPMT gene 
(from left to right in fee table). Positions of SNPs are mentioned in reference to the accession number 
AL589723 (reverse complement). To get a better overview, the wild type genotype is symbolized in 
Table 5 with a comma Q, the heterocygote is marked with an (o) and tie mutant homocygote is marked 
with an (X).' The real genotypes can be read from the bottom of the table. It can be seen from toe table 
that between SNP 47 and SNP 27 starts a transition of one haplotype block to another one, representing 
probably a crossover point of maternal and paternal chromosomes in meiosis. The downslream part of 
toe TPMT gene, which starts in Table 5 with SNP 27, codes for the last four exons of the TPMT protein. 
This haplotype block contains SNPs mat have in nearly all patients measured a similar allele frequency 
wito very similar occurrences of wild type, heterocygote and mutant genotypes. 
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One exception in this haplotype block is SNP 50, which shows an independent pattern. The " 
upslxeam part of fee gene contains SNPs that show from patient to patient a more independent 
pattern of allele frequencies (probably a hot spot of recombination). Due to these two adjacent 
haplotype blocks wifein one gene it is a priori not possible to conclude linked SNPs merely from 
5 the fact that they are neighbors on a gene. But surprisingly we found that S» 10, 17, 47 and 50 
are linked to each other, more precisely SNP 10 is highly linked to SNP 50 and SNP 17 is highly 
linked to SNP 47. Even more surprisingly we found that SNP 26 and 29 represent one haplotype 
mat is linked to the reference SNPs 47 and 50 and to fee SNPs 10 and 17 in fee following way: 

10 When SNP 26 being HT and SNP 29 being WT fee TPMT enzyme is deficient. 

When SNP 26 being MT and SNP 29 being WT fee TPMT enzyme is more deficient. 
When SNP 26 being MT and SNP 29 being HT the TPMT enzyme is deficient. 

In a similar way one can identify from Table 5 other haplotypes feat are linked to deficient TPMT 
15 enzyme activity: 

When SNP 7 being MT and SNP 20 being HT fee TPMT .enzyme is deficient. ' 

When SNP 7 being WT and SNP 8 being HT and.SNP 20 being WT fee TPMT enzyme is 

deficient. 

20 ' * 

In Table 5 other haplotypes can be identified to describe TPMT deficient individuals. For example 
any of fee haplotypes from row 1 to 57 in Table 5 can be used to describe individuals who are 
TPMT enzyme deficient using two or up to all of fee following SNPs: SNP1, 2, 3, 4, 7, 8, 9, 10, 
12, 16, 17, 18 f 20 ) 22,-23,25,26,27,28 ) 29 > 3l,32 } 33 > 34 J 36,41.InTable6isanexampleof 10 

25 individuals wife their respective haplotypes from Table 5: . 



WO 2005/066362 .43. 
Table 6: Examples of haplotypes of TPMT deficieat individuals 
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Each SNP in one row has to be combined wilh another one from the same row, whereas 
5 combinations can be two, toe, four or up to all SNPs. In most cases it will be sufficient to take 
one or two of the SNPs 27, 28, 29, 31, 32, 33, 34, 36, 41 because they are very tightly linked to- 
each other. (See complete Table 5). 

A further'example is given in Table 7 that shows the correlation of TPMT enzyme activity 
' measured* healthy volunteers together with meir individual haplotype of 10 SNPs. Erythrocyte 
10 lysates were analyzed for TPMT activity by a HPLC method using 6-mioguanine as substrate. The 
method is described in Kxoeplin, T. et al., Eur. J Clh, Pharmacol (1998) 54: 265-271. The enzyme 
activity was measured hi nmoVgHb/h. The TPMT activity showed a range from 0 nmol/gHb/h to 
106 nmol/gHb/h wi& a median of 46.6 nmoygHb/h and a mean of 47.6. When setting the cutoff to 
34.5 mnoVgHb/h the here presented haplotypes of patients whose TPMT value is below this cutoff 
15 have a sensitivity and specificity of 93 % respectively. With this example, the responding 
haplotypes are further examples of haplotypes that constitute the different TPMT phenotypes in 



~ PCT/EP2005/000064 
WO 2005/066362 . 44 _ 

ta^andcanbeusedas an aid fortberapy decision wbax respective patient ha,e to be treated 
with ihiopurines or derivatives. 

Table?: Correlation of SNPs and Haplotypes to TPMT Enzyme Activity 
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Table 8 gives a detailed summary o; 
special interests are 



are correlated to TPMT enzyme activity, Of 
low or medium TPMT enzyme activity. 



Table 8 : Detailed 
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5 Table 8, continuation: The corresponding genotype of each ShiP being WT, HT or MT can be read 
from the following table: 
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Legend of Table 8: SNP genotypes: WT - wildtype; HT = heterozygote; MT - mutant. TPMT 
enzyme activity: 0 = absent or low; 1 - medium; 2 = nonnal or Mgb 

A descriptive summary of the best haplotypes that are correlated to absent, low or medium TPMT ■ 
5 activity are given below: 

Haplotype group 1: 

men SNP 26 being MUTANT and SNP 29 being WILDTYPE the TPMT enzyme activity is 
absent or low. 

When SNP 26 being HETEROZYGOTE and SNP 29 being WILDTYPE the TPMT enzyme . 
10, activity is medium. 

' ' When SNP 26 being MUTANT and SNP 29 being HETEROZYGOTE the TPMT enzyme activity 
is medium, 

Haplotype group 2: 

When SNP 20 being MUTANT and SNP 7 'being MUTANT and SNP 8 being WILDTYPE and 
15 SNP 26 being WILDTYPE and SNP 29 being WILDTYPE the TPMT enzyme activity is absent or 
low. - 

When SNP 20 being HETEROZYGOTE and SNP 7 being MUTANT and SNP 8 being 
WILDTYPE the TPMT enzyme activity is medium. 

When SNP 20 being WILDTYPE and SNP 7 being WILDTYPE and SNP 8 being 
20 HETEROZYGOTE or MUTANT the TPMT eicyme activity is medium. 

Haplotype group 3: 

When SNP 10 being MUTANT and SNP 17 being MUTANT fee TPMT. enzyme activity is absent 
or low. 
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When SNP 10 being HETEROZYGOTE or MUTANT and SNP 17 being WILDTYPE the TPMT . 
enzyme activity is medium. 

When SNP 10 being MUTANT and SNP 17 being HETEROZYGOTE the TPMT enzyme activity 
is medium. 

When SNP 10 being HETEROZYGOTE or WILDTYPE and SNP 17 being HETEROZYGOTE 
the TPMT enzyme activity is medium. 

When SNP 10 being WILDTYPE and SNP 17 being MUTANT the TPMT enzyme activity is 
medium. 

As a further embodiment of mis invention one can combine the predictive power of the Here 
described genotype and haplotype correlations to me TPMT expression with me number of 
VNTRs in me respective patients. Whereas a higher number of repeats responds inversely to the 
TPMT activity.- 
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Claims 

1. An isolated polynucleotide molecule comprising a mutant allele of toopurine S-methyl- 
fefflsferase (TPMT) gene or fragments thereof containing single nucleotide polymorphisms 
(SNPs 1 - 41) as shown in table 1 . 

5 2. An isolated' polynucleotide molecule comprising a mutant allele of tbiopurine S- 
methyltransferase (TPMT) gene or a fragment thereof confining at least two or more of 
single nucleotide polymorphisms (SNPs 1- 41) as shown in Table 1 . 

3. An isolated polynucleotide molecule comprising a mutant allele of foiopurine S- 
me&yln-ansferase (TPMT) gene or fragments thereof containing single nucleotide 
10' polymorphisms, SNPs 10 and/or 17, and/or 26 and 29 in the following haplotypes 

(combinations): 

■ a) SNP 26 being MT (GG) and SNP 29 being WT (GG) 

b) SNP 26 being HT (AG) and SNP 29 being WT (GG) 

c) SNP 26 being MT (GG) and SNP 29 being HT (AG) 
15 ' d) SNP 10 being MT (TT) and'SNP 1 7 being MT (GG) 

e) SNP10bemgHT(AT)orMT(Tl)andSNP17bemgWT(TT) 
•f) SNP 10 being MT (TT) and SNP 17 being HT(GT) 

g) SNP 10 being HT (AT) or WT (AA) and SNP 17 being HT (GT) 

h) SNP 10 being WT (AA) and SNP 17 being MT (GG). ' • 

20 4. An isolated polynucleotide molecule comprising a mutant allele of fiiiopurine S-methyl- 
transferase (TPMT) gene or fragments thereof containing single nucleotide polymor- 
phisms, SNPs 7, 8, 20 and/or 26 and 27 in the following haplotypes (combinations): 

a) SNP 7 being MT (AA) and SNP 8 being WT (TT) and SNP 20 being MT (AA) 
and SNP 26 being WT (AA) and SNP 29 being WT (AA) 

25 b) SNP 7 being MT (AA) and SNP 8 being WT (TT) and SNP 20 being HT (AT) 

c) SNP 7 being WT (TT) and SNP 8 being HT (AT) or MT (AA) and SNP 20 being 
WT (TT). 
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5. An isolated polynucleotide molecule folly complementary to any one of the polynucleotide 
molecules of claims 1-4. 

6. A diagnostic assay or kit for' detennining tMopisrine S~methyl-transferase (TPMT) 
. genotype of a subject which, comprises 

5 a) isolating nucleic acid from said subject; 

b) amplifying specifically a tbiopurine S-methyltxansferase (TPMT) PCR fragment 
with primers of Table 2 from said nucleic acid, which includes at least one of 
SNPs of claims 1 - 4 thereby obtaining an amplified fragment; and 

c) genotyping the amplified fragment obtained in step b), thereby determining the 
10 tbiopurine S-methyltxansferase (TPMT) genotype or haplotype of said subject, 

d) the kit comprising sequence determination primers and sequence determination 
reagents, wherein said primers are selected from the group comprising primers that 

• hybridize to polymorphic positions in the human TPMT genes according to claims 
" 1 - 4; acid primers that hybridize immediately adjacent to polymorphic positions in 
15 the human TPMT gene according to claims 1-4, 

7, A kit as defined in claim 6 detecting a combination of two or more, up to all, polymorphic 
sites selected from the groups of sequences as defined in claim 1-4. *■ 



20 



8. 



A method for -determining a patient's individual.response to thiopurine therapy, including 
drug efficacy and adverse drug reactions, comprising . determining the identity of 
nucleotide variations according to claims 1-4. 
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<400> 60 1' 
tctegatgtt attgaac 

<210> 5X 

■ <211> 29 

<212> DNA 

<213> Homo Sapiens 



<400> 61 . 

cagaggaaaa tattcaatta agggttaag 



<210> 62 

<2ll> 20 

<212> BNA 

<213> Homo. Sapiens 



<400> €2 

ttfctcfccgat tfctattgaac 



<210> 63 

<211> 19 
<212> 

<213> Homo Sapiens 



<400> 63 

agcccgcaca gccttgcag 



WO 200S/066362 . 23 

<210> 64 

<211> 17 

<212> BHA 

<213> Homo Sapiens 

- <400> 64 
tcttcccggc cgafcagg 

<210> 65 
<211> 17 
<212> 

<213> Homo Sapiens 
<400> 65 

cocgcacggc ctfcgcag 

<210> .66 
. <211> 25 
<212> 

<213> 'Homo Sapiens 
<400> 66 

gctgtgccag agaattacta caaca 

<210> 67 

. <2U> 21 

<212> DUA 

<213> Homo Sapiens 

<400> 67 

tagcccggga attccccctt o 

<210> 68 

<2U> 17 

<212> DKA 

<213> Homo Sapiens 



PCT/EP20O5/OOOO64 



17 ' 



17 



25 



WO 2005/066362 



-24- 



PCT/EP2005/000064 



<400> 68 17 
ggcaacatcg cgacgaa 

<210> $9 
<211> 20 
<2X2> 

<2£3> Homo Sapiens 
<400> 69 

atacctcctg cccaggatta 

<210> 70 

<211> 23 

«212> DNA 

<213> Homo Sapiens 

<4Q0> 70 

tagcccggga afcttcccctt cfcfc 

<210> 71 , 

<2X1> 17 

<212> DHA 

<213> Homo Sapiens 

<400> 71 

atttcacaaa aatttct 

<210> 72 

<211> 25- 

<212> BNA 

<213> Homo Sapiens 

<400> 72 

gcacattaca agaattaagg aaggg 



<210> 73 



WO 2005/066362 ..-25- PCT/EP2005/000064 

<211> 21 
<212> DNA 
- <2X3> Homo Sapiens- 



<400> 73 

tfcgaggactt tgtttgtggg c 



<210> 74 , 

<211> 19 

<2X2> DWA 

<213> Homo Sapiens 



<4Q0> 74 

aaatfctcaca ataattfcct 



<210> 75 

<211> 19 

<212> DSTA 

<213> Homo Sapiens 



'<400> 75 

tcctacagct aactgaata 



<210> 76 

<211> 25 

<212> DNA 

<213> Homo Sapiens 



<400> 76 

cafcgggbaet ttecfcecttt cataa 



<2X0> 77 

<211> 23 

<212> nm 

<213> Homo Sapiens 



WO 200S/066362 . 2 6- PCT/EP2005/000064 

<400> 77 23 
fcgaggaaggt ggccaaatat aca 

<2l0> 78 
<2X1> 19 

<212> nsa 

<213> Hojbo Sapiens 



<400> 78 19 
tcctacagct tactgaata 



<210> 79 

<211> 28 

<212> X>NA 

<213>' Homo Sapiens 



<4D0> 79 ' , 28 

Sttataatgt agggtgatgt gagtggat 

<210> 80 

<211> 20 ' 

<212> DHA 

<213> Homo Sapiens 



<400> 80 * 20 

aaacctagat tgatttattt 

r 

<210> 81 

<211> IS 

<212> SHA r 

<213> Homo Sapiens 



<400> 81 18 
gcgagaccct gcctcaaa 

<210> 82 
<211> 24 



WO 2005/066362 w . FCT/EP2005/000064 

<212> DNA 

<213> Homo Sapiens 
<400> 82 

aaacctagat ttatttattt attt 
<2£0> S3 

<212> DNA 

■<213> Homo Sapiens 

<400> 83 

cacafcttaat tcfcacacatt t 

<2X0> 84 

<21X> 3 0 

<2X2> BHA 

<213> Homo Sapiens 

<400> 84 

fcgttctatca aaaagtgact ttgagataga 

<210> 85 
<211>, 21 
<2X2> DKA 
. <213> Homo Sapiens 

<400> 8S 
- atgcactgtg actcgggaga c 

<210> 86 

<211> 21 

<212> t>KA 

<213> Homo Sapiens 



<400> 86 

cacatttaat tcttcacatt t 
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WO 2005/066362 „ 2 8- 



<21Q> 87 

<213-> IS " 
<212> 

<213> Homo Sapiens 



<400> 87 

tctaacaacc actttact 



<2X0> SS 

<2ll> 21 

<212> BNA 

<213> Homo Sapiens 



<400> es 

ctgcccagaa caaggaatgt c 



<210> 89 

<21X> 29 

<212> ttfSTA 

<213> Homo Sapiens 



<400> 89 

agtagtctte atagcagcaa. taaatcatg 

<210> 90 

<211> 18 

' <212> DNA 

<213> Homo Sapiens 

<400> 90 

tctaacatcc acttfcacfc 



<210> 91 
<211> 15 
<212> DNA 



WO 2005/066362 _ 29 . . PCT/EP2005/000064 

<213> Homo Sapiens 



<400> 3X 
caagcctcag aagta 



<210> $2 

<2XX> 27 

<2X2> Vffih 

<213> Homo Sapiens 



<400> 92 

tcaacattaa fcttcafcggta cgttctc 



<210> 93 

<2XX> X5 

<212> OTA 

<2X3> Homo Sapiens 



<400> S3 
caagcctcgg aagta 



<210> 9A 

<2X1> 29 

<2X2> OTA 

<2X3> Homo Sapiens 



<400> 94 

gaaactacag gagttacact fcctcaggtt 



<2X0> 55 

<21X> 19 

<2X2> DHA 

<2X3>. Homo Sapiens 



<400> 95 

aaagaatttt caaaacatc 



WO 2005/066362 _3Q_ 

<210> 96 

<211> 19 

<212> SNA 

<213> Homo Sapiens 



<400> 95 

tccatggctc cagaggcfcc 



<2X0> 97 

<211> 30 

<212> BNA 

<213> Homo Sapiens 



<400> 97 

cagggctttc ctgattagta attaaaaata 



<210> 98 

<2X1> 19 

<212> X>HA 

<213> Homo Sapiens 



<400> 98 

aagaattttc taaacatcc 



<210> 9 9 

<211> 15 

<212> D23A 

<212> Homo Sapiens 



<400> 99 
caoctaecaa acaat 



c210> 100 

<211> 3 0 

<212> DNA 

<213> Homo Sapiens 



WO 2005/066362 
<400> 100 

gttgggaata ttaagtgaga taatgaatga 

<210> 101 

<213L> 24 

<212> .DNA 

<213> Homo Sapiens 

<400> 101 

agtccaetct tgccttfcaag gaaa 

<210> 102 

<211> . IS 

<212> MA 

■ <2X3> Homo Sapiens 

. <400> 102 

ttcacctacc atacaatt 

<210> 103 

<211> 17 

<212> DHA 

<213> Homo Sapiens 

<400> 103 
ct-fccaggcta tcaaaga 

<210> 104 

<211> 2$ 

<212> DHA 

<213> Homo Sapiens 

<400> 104 

aatgaatgaa aagtgttcac cfcacca 
<210> 105 



PCT/EP2005/000064 



30 



2'4 



IS 



17 



26 
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<2X1> 22 + 

<212> BKA 

<2XZ> Homo Sapiens 

<400> 105 

cataccattt catctcaacc gc 

<210> 106 

<2I1> 18 

<212> DHA 

<213> Homo Sapiens 

<400> 10G 
tcttcaggct attaaaga 

<21D> 107 

<2X1> ^21 

<212> BNA 

<213> Homo Sapiens 

<4Q0> 107 

attatcttgc cttaatgatg a 

<2io> ioa 

<2!1> 19 

<212> DWA 

<213> Homo Sapiens 

<400> 108 

gcggaaaagc ggtfcgagat 

<210> 109 

<211> 24 

<212> DNA 

<213> Homo Sapiens 



WO 2005/066362 - 33 

<4O0> 109 

cacatcctgt taaatcaccc aaag 

<210> 110 

<211> 23 

<212> BNA 

<213> Homo Sapiens 

<400> 110 

afcfcatctfcgt cttaatgatg aat 

<210> HI 

<211> 28 

<212> DBFA 

<213> Homo Sapiens 

<400> HI 

ggtgatttaa caggafcgfcga gttttaaa 

<210>" 112 

<2X1* 17 
<212> 

<213> Homo Sapiens 

<400> 112 
cattfctcagg aaataca 

<210> 113 

<211> 29 

<212> DMA. 

<213> Homo Sapiens 

<40Q> 113 

aagacttcat aoctgtttct gttgtttct 
<210> H4 



PCT/EP2005/000064 

24 



<211> IS 



WO 2005/066362 « 34 . 

<212> BKA 

<213> Homo Sapiens 



<400> 114 
ceattttcag taaataca 



<210> 115 

<21X> 18 

<212> DKA 

<213> Homo Sapiens 



<400> 115 
tfcacatttce tggatccfc 

<210> 116 

<211> 25 

<212> mh 

<213> Homo Sapiens 



<400> 116 

'gaagfcctttc tggattgagfc fcttgaa 

<210> 117 

<211> 24 

<212> BHA 

<213> Homo Sapiens 

<400> 117 

ccacctacaa aaactgaacc acat 

<210> US 

<211> 18 

<212> DNA 

<213> Homo Sapiens 



<400> US 
tacattttcfc ggatcctt 



WO 2005/066362 . 35 - PCT/EP2005/00D064 

<2X0> 1X3 

<211> 17 

<212> M8A 

<213> Homo Sapiens 



<400> 

ctctacaaaa agaattc 



<210> 120 

<211> 29 . 
<212> 

<213> Homo Sapiens 



'<400> 120 

accagtgatt aagaaagtat ttcttgtga 



<2X0> 121 
<2ll> 17 
<212> 

<213> * Homo Sapiens 



<400> 121 
tctacaaaga gaattca 



<210> 122 

<2X1> 28 

<212> OTA 

<213> Homo Sapiens 



<400> 122 

gggtaactca tagtaaaagt ggettgtt 



<210> 123 
<211> 17 
<212> DKA 



WO 20Q5/066362 
<213> Homo Sapiens 

<400> 123 
atgtaacaag tgccttc 

<210> 124 

<211> 28 

<212> m& 

<2X3> Homo Sapiens 

<400> 124 
gcaaagttafe gattttatgt caagtgaa 

<210> 125 

<211> 27 

<212> DNA 

<213> Homo Sapiens 

<40o> 125 : 

atttttagtg cgfegatttag eatagtg 

<210> 126 

<211> 17 

<212> DNA 

<213> Homo Sapiens 

<400> 126 
atgtaacatg tgccttc 

<210> 127 

<211> 23 

<212> Tffi& 

<213> Homo Sapiens 



PCT/EP2005/000064 

.36- 



<4Q0> 127 . 

ctctgtaaaa aattattgta tec 



WO 2005/066362 

<210> 128 
<211> 22 
<212> 

<213> Homo Sap leas 
<400> 128 

tctgtaaaaa afcfccttgtafc cc 

<210> 125 

<211> 28 

<212> OTA 

<213> Homo Sapiens 

<400> 123 

gggatatgga tacaattatt tacccaaa 

<21t» 13 0 

<211> 23 

<212> EHA 

<213> Homo Sapiens 

<400> 130 

tggtgbggaa atcagtgaac ttg 

<210> 131 

<21X> 12 

<212> PNA 

<213> Homo Sapiens 

<400> 131 
tcaccgaaat tc 

<210> 132 

<2X1> 16 
<212> 

<213> Homo -Sapiens 



PCT/EP2005/000064 



22 



28 



23 



WO 2005/066362 

<40Q> 132 
aceaafccact gaaatt 

<210> 133 

<21X> 17 

<212>. DNA 

<213> Homo Sapiens 

<400> 133 
cctggcataa gfcactgfc 

<210> 134 

<21X> 16 

<212> BETA 

<2X3> Homo Sapiens 

<400> 134 
cbggeatcag tactgt 

<210> 135 

■ <2X1> 22 

<2I2> DNA 

<213> Homo Sapiens 

<400> 135 

ccccaggoca afctatatcag aa 

<2X0> 136 

<211> 21 

<212> DBA' 

<213> Homo Sapiens 

<400> 136 

aactttgcct gcagatfcgga a 
<210> 137 



PCT/EP2005/000064 



WO 2005/066362 „ 39 . PCT/EP2005/000064 

<21X> 21 

<212> ENA 

<213> Homo Sapiens 

<400> 137 

agtaaaaafca tcactctget c 

<210> 138 

<21X> 27 

<212> ota 

<213> Homo Sapiens 

<400> 138 ' 

gataattggt tgacc'tgcag atfctatc 

<210> 139 

<211> 20 

<212>. PN& 

<213> 'Homo Sapiens 

<400> 139 

agagfcaaaaa 'tgfccaefcctg 

<210> 140 
<211> 29 
<212> TMK 

► 

<213> Homo Sapiens 

<400> 140 
■ gcttgctata aaattcfcaac aatgtttcc 

<210> 141 

<211> IS 

<212> ESTA 

<213> Homo Sapiens 



WO 2005/066362 , 40 . PCT/EP2OO5/O0O064 

<400> 141 
afccttcaaag ttgtccte 

<210> 142 . 

<211> 25 

<212> TMK 

<213> Homo Sapiens 

<400> 142 

ctctgaagtg agtaacagcc aactg 

<210> 143 

<211> 16 

<212> BNA 

<213>. Homo Sapiens 

<400> 143 
cttcaaggtt gtcctc 

<210>' 144 

<211> 26 

<212> BKA ' . , 

<213> Homo- Sapiens 

<400> 144 

gcactttatt ggcacctfcat tttttfc 

<210> 145 

<211> 16 
<212> 

<213> Homo Sapiens 

<4D0> 145 
ttagttgcca toaatc 

<210> 146 
<2X1> is 



WO 2005/066362 

<212> BH& 

<213> 'Homo Sapiens 

<400> 146 
ttagttgcca ttaatcca 

<210> U7 
<211> 23 

<2i2> , m& 

<213> ' Hoxao Sapiens 

<400> 147 
• eacaafccatc accacctcca eta 

<210> 14S 

<211> 27 

<212> 3>KA 

<213> Homo Sapiens 

<400> 148 

tcacttgcct afcgccaggta fctgttca 

<210> 149 

. <2X1> 23 

<212> m& 

<213> Homo Sapiens 

<400> 149 

cccagcccac ataaagtatt ttg 

<210> 150 

<211> 29 

1 <212> DW& 

<213> Hotoo Sapiens 

<400> 150 

ctggtcactt gectattcca ggtattgtt 



PCT/EP2005/OOOD64 
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<210> 151 

<211> 24 

<2i2> ona. 

<213> Homo Sapiens 



<4at>> 151 

ctaagtaatt caatatttcc atgc 



<210> 152 

<211> 25 

<212> DNA 

<213> Homo Sapiens' 



<4D0> 152 

caagtgatga gtctgctcca tacaa 



<210> 153 

<211> 22 

<212> DHA 

<213> Homo Sapiens 



<400> 153 

ctaagtaatt cagtatttcc at 



<210> 154 

<211> 3 0 

<212> DNA 

<213> Homo Sapiens 



<400> 154 

tgaccacatc tgtatactct ttcaattaaa 



<210> 155 
<211> IS 
<212> DNA 



WO 2005/066362 
<213> Homo Sapiens 



PCT/EP2005/000064 



<400> 155 
taggctccac atcag 

<210> 156 

<2X1> 16 

<212> DBA 

< 2 13 > Homo ■ Sap lens 

<400> 156 
ttaggctccc catcag 

<21Q> 157 

<21X> 20 • 

<212> UNA 

<213> Homo Sapiens 



<400> 157 

gggcaacgga gtgagattbc 

<210> 158 

<211> 26 

<212> DHA 

<213> Homo Sapiens 

<400> 158 

attaggtttg gcagfcaagcc tfcactg 

<210> 159 

<211> 15 

<212> BNA 

<213> Homo Sapiens 



<400> 159 
cgaaactccg tcfccg 
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<210> 160 

c21I> IS 

<212> DNA 

<213> Homo Sapiens 

<400> 160 
ccagccfcggg caacaaga 

<210> 161 

<211> 25 

<212> DNA 

<213> Homo Sapiens 

<400> 161 

gccaafcattfc gtcctaccag aaaga 

, <210> 162 
<211> 17 
<212> DNA 

<213> Homo Sapiens 

•<40O> 162 . 
cgaaacfcctg tctcgaa 

<21D> 163 

<212> DKA 

<2l3s> Homo Sapiens 

<4G0> 163 

agccttatgt tagtatttt 

<210> 164 

<211> 23 

<212> DHA 

<213> Homo Sapiens 



WO 2005/066362 , 45 „ PCT/EP2005/000064 



<400> 164 

ccaaagtgct gggattacag atg 



<210> 165 
<211> 13 

<213> Homo Sapiens 



<400> 165 

cccagcctta tcttagtafc 



<210> 166 

<2U> 28 

<212> DHA 

<213> Homo Sapiens 



<400> 166 

gtgctaacafc ggtaagtact gagtacca 



<21D> 167 

<211> 17 

<2X2i> BN& 

<213> Homo Sapiens 



<400> 107 
cagtttctac ggagcct 



<210> 168 

<211> 20 

<212> DMA 

<213> Homo Sapiens 



<400> 168 

ttccccacac tgaggaagga 



<210> 169 



WO 2005/0*6362 . 46 . PCT/EP2005/000064 

<2U> 19 
<212> / 
<213> Homo Sapiens 

<400> 169 

gbactfcgcct ccccaactt 

<21G> 170 

<211> .17 

<212> DH& 

<213> Homo Sapiens 

<400> • 170 
ccagtttcta tggagcc 

• <210> 171 

<211> 2S 

c212> DSTA 

<213> Hosao Sapiens 

<400> 171 
> gcctgfcgtag agaaacfcgtaa caaatacc 

<21G> 172 ■ 

■<2±X> 18 

<212> DNA 

<213> Homo Sapiens 

<400> 172 
aagtttcagt atctcctg 

<210> 173 

<211> 23 

<212> DNA 

<213> Homo Sapiens 



WO 2005/066362 
<4Q0> 173 

ggatgtttag ttggatcata agaaagaa 

<210> 174 

<211> 20 

<212> DKA 

<213> Homo Sapiens 

<400> 174 

aagaagtttc afcfcatctcct 

<210> 175 

<211> 16 

<212> DETA 

<213> Homo Sapiens 

<400> 175 
agtatcctca gactcc 

<210> 176 

<211> 20 

<212> DHA 

<213> Homo Sapiens 

. <400> 176 
cttccgcccc cttctaagag 

<210> 177 

<211> 26 

<212> DSTA 

<213> Homo Sapiens 

<400> 177 

aaagaacctt tgggaagaaa atacag 
<210> 178 



PCT/EP2005/000964 

2a 



20 
16 
20 



<211> 17 



WO 2005/066362 

<212> DKA 

<213> Homo Sapiens 

<40Q> 17S 
cagtatcctt agactce 

( <210> 179 

<211> 19 

<212> DNA 

<213> Homo Sapiens 

<400> 179 
tcttgtttoa agtaaaata 

<210> 180 

<211> .30 

<212> DKA 

<213> Homo Sapiens 

<40Q> ISO 

'cctgatgtea fcfccfctcatag tattttaaca 

<210> 1S1 
<211> 18 

<2i2> m 

<;400> 181 
tcttgtttca ggfcaaaat 

<210> 182 

<211> 25 

<212> DNA 

<213> Homo Sapiens 



PCT/EF2005/000064 



17 



■19 



30« 



<4G0> 182 

ccttctcaag acaacgtata ttgca 



25 



WO 2005/066362 
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<21Q> 183 

<211> 19 

<212> m& 

<213> Homo Sapiens 

<400> 183 

ccaactttta tgfccgttct 15 

<210> 184 

<211> 18 

<212> DKA 

<213> Homo Sapiens 



<40Q> 184 

caacttttct gtcgttct 18 



<210> 185 

<aii> 33 

<212> DNA 

<213> Homo' Sapiens 



<400> 185 

cafcgfccagfcg tgafcfcfctatfc ttatcfcatgt etc 33 



<210> 186 

<211> 30 

<212> MA 

<213> Homo Sapiens 



<400> 186 

cctgafcgtca ttcttcatag tattfctaaca 30 

<210> 187 
<2U> 22 
<212> DMA 



WO 2005/066362 

<2X3> Homo Sapiens 

<40f» 187 

ttctaaafctt caaaaagaat gt 

<210> 188 

<2ll> 23 

<212> DMA 

<213> Homo Sapiens 

<400> 188 

gaccaccttg aaccctactg aaa 

<210> 189 
<2X1> 17 

<2i2> ms& 

<213> Homo Sapiens 

<400> 189 
aggcgtgagc caetgca 

<210> 19.0 

<211> 23 

<212> DNA 

<213> Homo Sapiens 

<400> WO 

attctaaatt tcataaagaa tgfc 

<210> 151 

<211> 24 

<212> DttA 

<213> Homo Sapiens 



PCTYEP2005/090064 



22 



23 



17 

fw 



<400> 191 

tcicactttg ttgcccacgc tgat 



24 



WO 2005/066362 

<210> 122 

<21X> 23 

<212> DXTA 

<213> Homo Sapiens 

<400> 192 

ggaccaacac aatfcctctcc aga 

<210> X&3 

<211> 24 

<212>. OTA 

<213> Homo Sapiens 

<400> 193 

tctcacfettg ttggccacgc tgat 

<210> - 194 
<211> 20 
<212> 2>HA 

<213> Homo Sapiens . 
<400> 194 

ggaggactgc ttgaggcctc 

<210> 195 

<211> 14 

<2X2> BETA 

<213> Homo Sapiens 

<400> 195 
tcctcatgaa aagc 

<210> 196 

<211> 29 . 

<212> PKA 

<213> Homo Sapiens 



PCT/EP2005/000064 
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<40Q> 196 

caaagtcact ttttgataga acattfcctc 



<210> 197 

<211> 14 

<212> £NA 

<213> Homo Sapiens 



<400> 197 
tccfcgatgaa aagc 

<210> 198 

<21X> 2 0 

<212> OTA 

<213> Homo Sapiens 

<400> X$8. 

aagtgggtga acggcaagac 

<210> 199 

<211> 22 

<2X2> DtfA 

<213> Homo Sapiens 

<400> 159 

caaccgcttt tccgcaaaga gg 

<2XG> 200 

<2X1> 28 

<212> DHA 

<213> Homo Sapiens 

<400> 200 

ttctgttaat gtttatctgc tcatacca 
<210> 201 



WO 2005/066362 

<211> 24 
<212> DKf£ 

<21Z> Homo Sapiens 

. - v 

<:400> 201 

gcaagagtgg actgagggta tttt 

<210> 202 
<2L1> 25 

<2i2> rm& 

<213> Homo Sapiens* 
<400> 202 

tcaaccgcfct ttctgcaaag aggaa 

<210> 203 

<211> 20 

<212> DKA 

<213> Homo Sapiens 

<400> 203 

tccccggtct -gcaaacctgc 

<210> 204 

. <211> 24 

<212> m& 

<213> Homo Sapiens 

<400> 204 

tcactgattt ccacaceaac taca 

<210> 205 

<211> 20 

<212> DHA. 

<2X3> Homo Sapiens 
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<400> 205 

ccccggtctg gaaacctgca 

<210> 206 
<211> 25 

<2i2> mm 

<213> Homo Sapiens 
<40Q> 206 

tgttctttga aaccctatga acctg 

<210> 207 

<21X> 16 

<212> DMA 

<213> Homo Sapiens 

<400> 207 
ctggaaccaa agtatfc 

<210> 208 . 

<211> 16 

<212> nBFA 

<213> Homo Sapiens 

<400> 208 
ctggaaccac agtatt 

<210> 209 

<211> 28 

<212> DttA 

<213> Homo Sapiens 

<400> 209 

acagagcaga atctfctcstta ctcagaag 
<210> 210 
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<211> 28 
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<212> DNA 

<213> Homo Sapiens 



<400> 210 

gggafcatgga tacaatfcatt tacccaaa 

<210> 211 

*211> 18 

<212> E15A < 

<213> Homo Sapiens 



<400> 211 
tactgcaaca gtacaafcg 

<210> 212 
<211> 23, 
<212> 

<213> Homo Sapiens 
<400> 212 

tcaacctacc tgggaagatc aaa 

<210> 213 

<211> 23 

<212> DHA 

<213> .Homo Sapiens 

<400> 213 

ggcccfccttt ccttgactat fcca 

<210> 214 

<211> 20' 

<212> DNA 

<213> Homo Sapiens 



<400> 214 

atactgcaafc agfcacaatga 
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<210> 215 

<211> 18 

<212> m& 

<213> Homo Sapiens 

<400> 2X5 
caactaatgc tc'ctctat 

<210> 216 

<211> 31 . 

<212> DNA 

<213> Homo Sapiens 

<400> 21$ 

gctaaacaaa aaaagaaaaa ttacttacca t 

■ <210> 217 

<211> 20 

<212> DMA 

<213> Homo Sapiens 

<400> 217 

tgcgatcacc tggattgatg 

<210> 218 

<211> 29 

<212> DUSK ■[ 

<213> Homo Sapiens 

<400> 218 

tcfctaaagat ttgatttttc tcccafcaaa 

<210> 219 
<211> 24 
<212> UNA 
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<213> Homo Sapiens 

<400> 2X9 

tfcctggtagg acaaatattg gcaa 

<210> ■ 220 

<211> 13 

<212> DKA 

<2X3> Homo Sapiens 

<4Q0> 220 
caactaatgt tccfcctatc 

<210> 221' 

<211> 17 

<212>, DHA 

<213> Homo Sapiens 

<400> 221 
atccaggtga tcacaaa 

■ <210> 222 
<211> 24 

<2i2> m& 

<213> Homo Sapiens 
<400> 222 

tfcctggtagg acaaatattg gcaa 

<210> 223 

<211> 15 

<212> OTA 

<213> Homo Sapiens 

<400> 223 
ccaggtgatc gcaaa 
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<2XQ> 224 

<21X> 31 

<212> DNA 

<213> Homo Sapiens 

<400> 224 

gctaaacaaa aaaagaaaaa ttacttacca t 

<2XD> 225 

<211> 17 . 

<212> X>N& ■ 

<213> Homo Sapiens 

<4Q0> -225 
taactcttca aaaagac 

<210> 226 
<211> 33 
<212> -DNA 

<213> Homo Sapiens \ 
<400> 226: 

catgfccagtg fcgattfctatt tfcatctatgt etc 

<210> 227 

<211> 25 

<212> DSTA 

<213> Homo Sapiens 

<40O> 227 

gagaaggttg atgcttttga agaae 

<210> 226 

<211> 18 

<212> DJTA 

<213> Homo Sapiens 
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<400> 228 
afca aettttc aaaa a <j ac 

<210> 229 ' 

<2X1> 24 
<212> 

<2X3> Homo Sapiaas 



<40Q> 229 

ttfcgaaaagt tatatctact taca 



24 

<210> 230 ■ ■ 

<212> 25 

<212 > DHa 

<223> Eoao Sapiens 

<4D0> 230 

tgatsettfct gaagaacgac afeaaa 

25 

<210> 231 
<211> 24 
<212> 

<2X3> Homo Sapiens 



<4O0> 

■ fctttfcgaaaa gtfcatgtcfca ctta 



24 

<2X0> 232 



<211> 25 

<212> BSA 

<213> Hbmo Sapiens 



<400> 232 

tcctcaaaaa cat gtC a gfcg fcgafct 



25 



